Why are some non-image PDF files not indexed?
1. Why are some non-image PDF files not indexed?
I was surprised to find some of the non-image PDF files not indexed. (My Zotero has up-to-date PDFTOTEXT and PDFTOINFO installed.)
2. Does it matter whether the PDF was created with embedded fonts or was originally an image file later OCRed, when Zotero decides whether to index or not?
3. Is there a batch function to index all non-image PDF files that have not been indexed for one reason or another? Or do you have to go over each PDF file, check their index status and index them manually?
I was surprised to find some of the non-image PDF files not indexed. (My Zotero has up-to-date PDFTOTEXT and PDFTOINFO installed.)
2. Does it matter whether the PDF was created with embedded fonts or was originally an image file later OCRed, when Zotero decides whether to index or not?
3. Is there a batch function to index all non-image PDF files that have not been indexed for one reason or another? Or do you have to go over each PDF file, check their index status and index them manually?
This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.
3. You can re-index all files (under search in the preferences), but there is no option to just re-index unindexed files, no. That said, if you have indexing turned on, in principle the case of a file that doesn't index automatically, but does index manually shouldn't exist.