Cypheros Transportstream Forum

English-speaking Support => TS-Doctor 3.x => Thema gestartet von: chemag am November 03, 2020, 13:53:10

Titel: OCR and spanish DVB subtiles
Beitrag von: chemag am November 03, 2020, 13:53:10
During the process of Spanish DVB subtitles with OCR, characters such as ¿, ¡, ó, ú are always erroneously processed.

The "DVB Subtitle OCR" window shows OCR language: eng,
although I have in "correction behavior", default language "spanish: castilian"

Can something be done about it?

Thank you.
Titel: Re: OCR and spanish DVB subtiles
Beitrag von: chemag am November 03, 2020, 14:03:55
I'm not able to edit previous message so:

Translate text:
-----------------------------------------------
60
00:05:41,800 --> 00:05:43,800
iSophia!               ---------> This should be ¡(Open exclamation not i(vowel)) Shopia!

61
00:05:47,880 --> 00:05:50,720
5A dc'mde vas?    ------- > should be    ¿A dónde vas?
A trabajar.

62
00:05:50,720 --> 00:05:53,040
gTu no te relajas? -------> ¿Tu no te relajas?

63
00:05:53,040 --> 00:05:57,560
Estoy ocupada.
Por eso Lo digo. éDonde trabajas? -----> ¿Donde trabajas?
Titel: Re: OCR and spanish DVB subtiles
Beitrag von: Cypheros am November 04, 2020, 10:01:48
The problem is that the DVB language is marked as srd (Sardinian), the OCR language is eng and the real language is spanish.

Try to change the DVB language to spa.
(https://cypheros.de/forum_ger2/index.php?action=dlattach;topic=5016.0;attach=9756;image)
TS-Doctor should then use the correct language preferences for the OCR.

Default language is only used if no language discriptor is present in the recording.