Indexing not working: D637792052
Hello, all. I have Zotero working on several machines. I can't get indexing to work on any, neither the full index nor the option for unindexed only.
I am able to index individual PDFs by clicking Indexed: Partial function.
I have pdftotext and pdfinfo 3.02a installed.
Thoughts?
Thanks!
I am able to index individual PDFs by clicking Indexed: Partial function.
I have pdftotext and pdfinfo 3.02a installed.
Thoughts?
Thanks!
https://1drv.ms/i/s!ApTaaiY2zZRmjgDoRS6sDueK2rhU
Let me know if the screenshot is not sufficient.
Thanks!
And what version of Zotero are you running?
@redcloud111, should upgrade to Zotero 5.0.
Thanks!
I did removed the old RTF/ODF scan addons and installed the new beta that works with 5.0 but still getting the error and not able to sync: 1108440472
I fear now I have two problems: 1) the original index issue and 2) this sync issues on my MBP
But I still have the indexing problem at 1312258684 and 1476699390
Any ideas on the sync issue, which I reported from my two mac machines: 1312258684 and 1476699390, or does this corruption issue affect indexing as well?
Thanks!
D523954063
and the report from my primary iMac:
D1917122520
I have deleted and reinstalled the pdftotext and pdfinfo on three machines, and still my library has these errant partially installed PDFs.
Can someone advise steps?
First, to clarify, "Partial" just refers to files that are bigger than your indexing settings allow, and the buttons in the preferences follow those same settings. If you reindex an item individually, that triggers a full reindex. Otherwise you'd have to increase the max pages/characters settings and rebuild the index, but we don't particularly recommend that, since it will slow down searching.
Other things:
For the PC, if you're still having trouble there, you can try the 5.0 Beta, which should fix some problems with background full-text indexing that were showing up in your debug output. If that doesn't help, provide another Debug ID for an index attempt from that.
For the iMac, you're getting this: This is what pdftotext returns when there's a permissions problem with the PDF. PDFs can disallow text extraction, though the degree to which different tools (and versions of tools) obey that varies. But that would explain unindexed files.
I have increased the size of the pages and characters to see if this works. So far, so good.
I will update if I have any questions. Thanks so much for your help!
764362703
1745223485
D556429149
Thanks!
That debug output shows ongoing full-text indexing. There are still some earlier errors logged, but no problems are showing up in the debug output itself (other than a couple PDFs that don't allow text extraction).
Also, have you increased the max pages/characters settings, and if so to what?
I am sorry if I am misunderstanding how this works, but I assumed there is a problem because I still see 740 partially indexed items. If I do them manually, they index. So, I was hoping the tool would automate this for me. All would be well in my mind when I saw the partially indexed number at zero. I assumed the non-indexed items would be non-OCRed pdfs and the such. Is this not the case?
I increased the numbers to 750000 characters and 300 pages. Should I go higher?
Also, I saw the last cancellation error when I debugged and ran a partial index.