Feature request: translate hex encoding in doi urls so doi lookup doesn't fail
I love the doi lookup function in Zotero: has saved me hours (and Z has saved me months of my life: thanks!) This is something I hit quite often now when copying a doi from a web page. It pastes into Z as the hex encoded version, e.g.
TIA,
Chris
https://doi.org/10.1177%2F25152459211035109
and that causes Z to, not unreasonably, say there is no such doi. Quite, it is should be https://doi.org/10.1177/25152459211035109
which is what I saw on the screen when I copied it. Would it be possible for Z to "de-hex" pasted magic wand lookups, or to try de-hexing them when at first they fail? TIA,
Chris
Also, you always want to use the Save to Zotero button when you can. If you have a DOI link, you would normally just click it and save from the article page, not paste the DOI URL into Zotero. Add Item by Identifier shouldn't be the main way you get items into Zotero regardless, and certainly not with DOI URLs. So I'm not sure we want to do this.
I find that many of the ways I get sent journal listings only give the DOI in this way so it's not possible to select the DOI as it's hex encoded in the link. I guess that copying those links had seemed not that different from copying the DOIs and had mostly worked for me in recent years.
"Also, you always want to use the Save to Zotero button when you can." Why?! Very often I can only get as far a page about the paper as the paper is behind a paywall. Often I then store the reference and abstract so I can find it easily in Z later should I want to. I think that gives me a much cleaner library than saving the page to Z.
Tangentially: you say "you always want to" and "Add Item by Identifier shouldn't be the main way you get items into Zotero regardless, and certainly not with DOI URLs."
Why? Mostly it works really well for me and until I started to hit this issue with hex encoded URLs it was simply fantastic as I've said. So why deprecate it?
Is there a place that deals out the logic to deprecations and prescriptions like this? I have used Z since a 1.? version but then found the standalone version, as that appeared, more to my liking and the save to Z button isn't something I use a lot, I use it almost entirely for web pages and haven't used it for formal papers with DOIs for years. I do sometimes read the Z update summaries but they are (rightly) about the actual changes, not about changing recommended and denigrated usage. Perhaps I am missing something that does summarise changes in those things?
TIA, Chris
Using the browser button has been the preferred way to get data into Zotero from day one. It gets full text, it's more likely to get abstracts and it allows us to compensate for site specific data issues.
For the Q at hand, the problem really is that the strings in DOIs aren't restricted, so a %is perfectly valid in a DOI. One thing we could consider, though, is that I believe the prefix, i.e. the first part after 10. is more tightly prescribed. If I'm right about that, we could just assume that any DOI wit a % before the first / is already URI escaped
If the scripting power behind the magic wand could recognize the encoded slash or automatically remove the address-part that precedes the actual DOI (and if I was young enough) I'd do cartwheels to express my joy.
*Sometimes, with these websites, copying the article abstract is recognized by the webpage and promotional text for the journal and publisher is appended to the abstract text that I thought I copied. (The promotional stuff is not displayed anywhere on the screen.)
10\.\d+%2F.+
as a) the beginning of a valid DOI and b) an indicator that the DOI is already URI escaped, that'll work nicely. I agree that there are enough cases of URI encoded DOIs in the wild to make this quite useful.@DWL-SDCA: Can you provide an example of what you're describing?
I will encounter again each of the problems I mentioned in my earlier post and will make note of the details. The problem of journals not having a translator and needing to enter by hand + cut and paste is fairly common for obscure East Asian, Eastern European, and central African journals. Quite a few of these journal articles present something they claim is a DOI but I can find no service that will resolve them. Many times the publisher presents these as a PDF with no html version. Zotero will import the PDF file but can not identify metadata. Often these PDFs are protected and I cannot copy and paste from them.
@dstillman If there is a way to select/copy from Zotero items in the trash-bin (without restoring the binned record to my library) I could probably give examples in a few minutes. Even better would be if I could sort my binned articles to find records that are missing a PMID in the Extra field. That would make it easier to find the records of problem items I encountered in the past couple of days. As it is I'll scroll through the binned records sorted by publisher and see if that can trigger my memory.
From memory, this used to be a problem with SAE journals, now MOBILUS, journals but now I see that the Mobilus journal translator works with DOI. [Off topic: Unfortunately, the SAE is slow to apply for and post DOIs so articles such as https://saemobilus.sae.org/content/02-14-03-0023/ without an assigned DOI cannot be imported except as a webpage with essentially no metadata for the article at all.]