Fulltext search in Web Library yields no additional hits
In my web library there is no difference in number of hits when switching from "Title-Creator-Year" to "Title-Creator-Year + Fulltext Content".
(search for "reliability": 17 hits, same as in the desktop client, while the latter gives 74 hits with "all fields + tags" (of which 6 are tagged), and 473 hits with "everything")
My setup is zotero 6.0.18 with data syncing (fulltext syncing is checked and was successful), but without file syncing (I'm using linked attachments).
Is this an expected limitation of my setup that I have missed or a bug?
Edit:
Did some experiments, and it looks as though this might be a reindexing problem: The fulltext search is ok for a newly imported subcollection, and when (in the original subcollection) I manually reindex one of the "missing" items, it yields a hit in the web search.
Anyone an idea for the reasons, so I can avoid those problems in future?
(search for "reliability": 17 hits, same as in the desktop client, while the latter gives 74 hits with "all fields + tags" (of which 6 are tagged), and 473 hits with "everything")
My setup is zotero 6.0.18 with data syncing (fulltext syncing is checked and was successful), but without file syncing (I'm using linked attachments).
Is this an expected limitation of my setup that I have missed or a bug?
Edit:
Did some experiments, and it looks as though this might be a reindexing problem: The fulltext search is ok for a newly imported subcollection, and when (in the original subcollection) I manually reindex one of the "missing" items, it yields a hit in the web search.
Anyone an idea for the reasons, so I can avoid those problems in future?
Possibly related: There is one article found by searching "mikrogl" but not "mikrog" or "mikro" (whereas the desktop clients has no problems finding it by any of the three phrases).
Looks as though the web search uses the fulltext index not the same way as the desktop client does.
Thanks for reporting.
(Out of curiosity: This means the index from linked attachments (from the desktop client) is not used as is (after sync to the zotero server), but has to be processed/re-indexed itself prior to being used by the web search?)
(I wonder if the problems might be related to https://forums.zotero.org/discussion/101222/partial-library-sync-between-devices#latest or https://forums.zotero.org/discussion/101049/different-search-results-on-different-computers#latest - perhaps this help to track the error down?)
The backlog is still processing, but it looks like all but two of your PDFs have been indexed. (Those two were uploaded a couple days earlier, and we haven't processed them yet.)
Note that you should test with full words. It's a different kind of search engine, and it won't necessarily match prefixes in the same way as the desktop app, though we may be able to improve that.
Sorry - I'm still a bit confused by your wording "PDFs have been indexed", so for clarity's sake: We are not speaking about the PDF itself that is getting indexed on the server side (since I'm using linked attachments without file syncing), but the transferred index (belonging to a PDF, or to a note) that has yet to be processed, right?
(There are also notes that are not found in the web search.)
To avoid spamming you with unnecessary updates of mine: could you give me a hint when at the earliest I should text again in case the problem (with search for full words) should persist?
(And yes, I'm using "PDFs" loosely. The desktop app uploads the raw extracted text and that gets indexed, but the text is associated with a given PDF attachment.)
(If necessary, I can send sample PDFs via email to support@zotero.org?)
The corresponding browser URL ends with "...search/reliability/everything"
The item keys for the 2 example parent items are HH8HC3YR and HVT4VLUK
But for now I've increased that to 300 items, and we can see how that performs. That will cover all results for you for this search.
(The current system isn't ideal in that it does the full-text search independently of other fields, so other search terms don't affect the full-text limit, but that's just how this works at the moment.)
Thank you very much for checking and clarifying (and increasing the limit)!
My results for the desktop app and the web library are very similar now (269 vs. 270 items) - the remaining differences probably caused by the different behaviour of the web version you mentioned, not being able to search for prefixes (e.g. search for "achievement" -> 85 items, "achievements" -> 24 items; both merged manually -> 99; desktop "achievement"- > 98 items).
One final suggestion: As long as the web library has limited search capabilities, how about listing those restrictions on https://www.zotero.org/support/searching to avoid confusion about different search results?