Majority of PDF metadata retrieval is wrong
Hi,
The majority of PDFs are incorrectly recognized since Zotero was switched over to the new service. The mismatches are often hilarious (don't even have a subject matter in common). I can't figure out what the supposed commonality is (it's not the ISBN/ISSN, for example, and it's not the DOI, where one exists).
Is there some way to configure it to use the Google Scholar resolver again?
In this state, Zotero becomes borderline useless to me as I have to create all "parents" manually.
Is anyone else seeing similar behaviour?
The majority of PDFs are incorrectly recognized since Zotero was switched over to the new service. The mismatches are often hilarious (don't even have a subject matter in common). I can't figure out what the supposed commonality is (it's not the ISBN/ISSN, for example, and it's not the DOI, where one exists).
Is there some way to configure it to use the Google Scholar resolver again?
In this state, Zotero becomes borderline useless to me as I have to create all "parents" manually.
Is anyone else seeing similar behaviour?
(And also a reminder that even with the new service, you're still almost always better of importing items via the save to Zotero button and not by using retrieve metadata)
Regarding your point about importing—sure, but the Google Scholar service was way more accurate. I never had any _mis_matches, just the occasional case of "too little" information.
I'll hold off on reporting these issues until/unless they occur again now that I've reinstalled (which I assume has reset my various parameters).
Reinstalling shouldn't really have any effect on the recognizer, so you might try re-adding one of the earlier PDFs that was recognized incorrectly to see if it's still incorrect and report it if so.
Note that "reporting" means right-clicking on the new parent item and selecting "Report Inaccurate Metadata" (which is only available for a limited time after the retrieval).
Re: "borderline useless", keep in mind that, while the recognizer should do a pretty good job, and we understand that people's workflows differ, adding items and their associated PDFs via the "Save via Zotero" button is still the best/recommended way of getting the majority of items into Zotero, so a misbehaving recognizer generally shouldn't be a significant impediment.
The ISBN being frequently on the last pages of ebooks may account for the lack of recognition, but classification as Journal Article is a bug. Any plans to improve on those issues soon?
Thanks for the great work!
The 10% incorrect ones would be useful to report.