Tesseract OCR not recognizing PDFs in Japanese

Hello, I've been using the Zotero OCR plugin with Tesseract OCR. This works wonderfully for PDFs in English, but when I try it on PDFs in Japanese, the OCR returns gibberish. As far as I can tell from documentation and previous discussions on here, Tesseract supports and should be able to recognize Japanese. Is there a setting I'm missing?

Thank you!

This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.

Sign In or Register to comment.