Metadata retrieval from JSTOR doesn’t work with older format URLs

philipconway · February 15, 2023

I have an issue with retrieving metadata from certain JSTOR items. It seems to be the same issue as raised here, but I have a bit more information.

The problem seems to be that JSTOR has two formats for its 'stable' URLs. For example:

https://www.jstor.org/stable/43154584
https://www.jstor.org/stable/10.5325/philrhet.47.3.0227

Zotero can import the latter but not the former, presumably because in the latter case it has a DOI.

N.B. I am primarily using the Node.js translation-server (although I also tested the desktop app). Like this:

```
curl -d 'https://www.jstor.org/stable/43154584' -H 'Content-Type: text/plain' http://127.0.0.1:1969/web
-> Internal Server Error
```

Is the the former, shorter 'stable' format simply not supported? This is a shame as the vast majority of JSTOR's content, historically, has this format. It looks like the citation information could, in principle, be scraped from the page, as it can be retrieved manually by clicking the 'cite' button. However, I don't know how difficult JSTOR make this.

adamsmith · February 15, 2023

The explanation as given in the other thread is exactly what's going on, and I don't really see what Zotero can do about that:

- URLs of the format https://www.jstor.org/stable/43154584 are absolutely supported using Zotero and the browser connector, which is how 99% of users use Zotero, so 'not supported' is a misnomer.
- JSTOR blocks what it perceives as automated scraping. That's clearly true for ZBib and translation server via the web library, but appears to also affect your node.js server on localhost. Not sure why, but that's a question for them.
- Zotero's translation server will query DOIs before URLs, which is why importing https://www.jstor.org/stable/10.5325/philrhet.47.3.0227 works: you can actually tell that the data are coming from CrossRef, not JSTOR, by checking the Library Catalog field after import

dstillman · February 15, 2023

I took another look at this and was able to fix it for now. Update here: https://forums.zotero.org/discussion/comment/428223/#Comment_428223

philipconway · February 15, 2023

Thanks for the responses. I understand that this is really a JSTOR problem (and that my use case is... fringe!).

dstillman · February 15, 2023

But I said I fixed it…

philipconway · February 16, 2023

Right, I can confirm that the translation-server is now working as expected. Thank you very much :)