Duplicated Snapshots
I'm relatively new to Zotero. Sometimes when I save information from arXiv or conference websites using the Zotero browser plugin, two identical webpage snapshots are created under the paper item. How can I avoid the duplicates?
My environment is Windows 10 Pro (Build 19042), Chrome 87.0.4280.88.
Examples of problematic webpages:
https://arxiv.org/abs/2005.14165
https://papers.nips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html
Sometimes I get messages like "An error occurred saving with arXiv.org. Attempting to save using Embedded Metadata instead." while saving paper information, I'm not sure whether it is related to this issue.
My environment is Windows 10 Pro (Build 19042), Chrome 87.0.4280.88.
Examples of problematic webpages:
https://arxiv.org/abs/2005.14165
https://papers.nips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html
Sometimes I get messages like "An error occurred saving with arXiv.org. Attempting to save using Embedded Metadata instead." while saving paper information, I'm not sure whether it is related to this issue.
If you're seeing this in 5.0.94, let us know (and let us know specifically whether it happens without getting an error in the save popup).
The error messages from the browser plugin options are shown the following. I can open the first arXiv metadata link in the browser without any visible delay (so I have no idea why the HTTP request fails). The second link is indeed unaccessible (since the information has not been made available for the pre-proceeding).
[JavaScript Error: "HTTP GET http://export.arxiv.org/oai2?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:arXiv.org:2005.14165 failed with status code 0"]
[JavaScript Error: "HTTP GET https://papers.nips.cc/paper/2020/hash/1457c0d6bfcb4967418bfb8ac142f64a-Abstract.html/bibtex failed with status code 500"]
I'm able to save from both of those without getting two snapshots. The first completes without an error for me — arXiv or some proxy server may be blocking the export.arxiv.org request for you. The second falls back to Embedded Metadata due to the server error, but I don't get a duplicate snapshot.
BTW, sometimes I do use a proxy server (not in the two runs I just did) but I don't understand how the request could fail if the link can be opened in the browser (using the same proxy).
I have disabled all other extensions but the error remains.
arXiv Link Debug IDs
Zotero: D64931766
Zotero Connector: D278768867
NIPS Link Debug IDs
Zotero: D1040259412
Zotero Connector: D670808515
Here is a debug ID from the connector: D2046828332 (with my only other firefox addon [singlefile] disabled)
Unlike arxiv.org, there does not appear to be a dedicated translator for biorxiv, and the Connector is saving the item using Embedded Metadata. Here is the kicker, I don't get duplicated snapshots on arxiv.org.
Additionally, all saved abstracts also start and end with the html tags for paragraph. Probably coming from the meta tag for citation_abstract. Kinda annoying but harmless.
An Example URL: https://www.biorxiv.org/content/10.1101/2020.05.28.122242v1
I am working on a fix and will update here when I have more information.