Zotero OCR on M1 Macbook
I'm wondering if anyone who has the Zotero ocr plugin working on Mac (M1, Big Sur) could help me out. I can't get it working.
I've installed tesseract and poppler with Homebrew, installed the zotero plugin and set the path in the Zotero plugin to:
(/opt/homebrew/Cellar/tesseract/4.1.1
/opt/homebrew/Cellar/poppler/21.03.0_1/bin
I can confirm that this where those files do live.
I've also copied the pdftoppm into /Applications/Zotero.app/Contents/MacOS/pdftoppm according a recommendation elsewhere on this forum, although I have tried to run the ocr with and without this step.
When I run the plugin, an ocr file appears but when I try to open it I get the following error:
Format Error: Not a PDF or corrupted.
PDF.js v2.8.146 (build: 7dd64325d)
Message: Invalid PDF structure.
Help? ...
I've installed tesseract and poppler with Homebrew, installed the zotero plugin and set the path in the Zotero plugin to:
(/opt/homebrew/Cellar/tesseract/4.1.1
/opt/homebrew/Cellar/poppler/21.03.0_1/bin
I can confirm that this where those files do live.
I've also copied the pdftoppm into /Applications/Zotero.app/Contents/MacOS/pdftoppm according a recommendation elsewhere on this forum, although I have tried to run the ocr with and without this step.
When I run the plugin, an ocr file appears but when I try to open it I get the following error:
Format Error: Not a PDF or corrupted.
PDF.js v2.8.146 (build: 7dd64325d)
Message: Invalid PDF structure.
Help? ...
-
bwiernikI suggest you post this issue on the Zotero OCR GitHub page.
-
AndrewRRMAh yeh, will do.
-
AndrewRRMAs an update, I gave up with ocr within Zotero and use ocrmypdf run from the terminal. Seems to work OK, although requires a few extra steps.
-
zuphilipI answered you now on GitHub, where we can continue this issue.