You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'd like to suggest adding the possibility of selecting more than one OCR language.
Talking about my personal experience, the text of the vast majority of my images is in English or Spanish, so it'd save me time if I could just have both Spanish and English permanently selected unless I'm sure that I'll stick to one language for a good while.
In many cases, switching from one language to another for each individual image might not be worth it, and this multilanguage support would prove even more useful when using the Extract Images from Files in Folder tool, as you may be dealing with many different images, each in a particular language.
I can also think of a scenario where a number of images contain multilingual text, but this might not happen frequently.
As I rarely use a different language from Spanish and English, this feature would be good enough for me, but another (complementary) approach to consider would be to add an automatic mode where the OCR engine detects the language. I'm not sure that I have seen this in Tesseract, but I saw it in Whisper the other day and I thought it was a good idea.
Anyway, these are my suggestions, hoping that you may take them into account for a future release.
Thank you for your time.
The text was updated successfully, but these errors were encountered:
Just to layout what is possible vs what would be ideal. Tesseract could do multi-language, but the Windows OCR API cannot do multi-language. So if this feature was implemented it would only be possible during FullScreen Grab and batch processing through the Edit Text Window.
Hello,
I'd like to suggest adding the possibility of selecting more than one OCR language.
Talking about my personal experience, the text of the vast majority of my images is in English or Spanish, so it'd save me time if I could just have both Spanish and English permanently selected unless I'm sure that I'll stick to one language for a good while.
In many cases, switching from one language to another for each individual image might not be worth it, and this multilanguage support would prove even more useful when using the Extract Images from Files in Folder tool, as you may be dealing with many different images, each in a particular language.
I can also think of a scenario where a number of images contain multilingual text, but this might not happen frequently.
As I rarely use a different language from Spanish and English, this feature would be good enough for me, but another (complementary) approach to consider would be to add an automatic mode where the OCR engine detects the language. I'm not sure that I have seen this in Tesseract, but I saw it in Whisper the other day and I thought it was a good idea.
Anyway, these are my suggestions, hoping that you may take them into account for a future release.
Thank you for your time.
The text was updated successfully, but these errors were encountered: