Metadata retrieval from JSTOR doesn’t work with older format URLs
I have an issue with retrieving metadata from certain JSTOR items. It seems to be the same issue as raised here, but I have a bit more information.
The problem seems to be that JSTOR has two formats for its 'stable' URLs. For example:
https://www.jstor.org/stable/43154584
https://www.jstor.org/stable/10.5325/philrhet.47.3.0227
Zotero can import the latter but not the former, presumably because in the latter case it has a DOI.
N.B. I am primarily using the Node.js translation-server (although I also tested the desktop app). Like this:
```
curl -d 'https://www.jstor.org/stable/43154584' -H 'Content-Type: text/plain' http://127.0.0.1:1969/web
-> Internal Server Error
```
Is the the former, shorter 'stable' format simply not supported? This is a shame as the vast majority of JSTOR's content, historically, has this format. It looks like the citation information could, in principle, be scraped from the page, as it can be retrieved manually by clicking the 'cite' button. However, I don't know how difficult JSTOR make this.
The problem seems to be that JSTOR has two formats for its 'stable' URLs. For example:
https://www.jstor.org/stable/43154584
https://www.jstor.org/stable/10.5325/philrhet.47.3.0227
Zotero can import the latter but not the former, presumably because in the latter case it has a DOI.
N.B. I am primarily using the Node.js translation-server (although I also tested the desktop app). Like this:
```
curl -d 'https://www.jstor.org/stable/43154584' -H 'Content-Type: text/plain' http://127.0.0.1:1969/web
-> Internal Server Error
```
Is the the former, shorter 'stable' format simply not supported? This is a shame as the vast majority of JSTOR's content, historically, has this format. It looks like the citation information could, in principle, be scraped from the page, as it can be retrieved manually by clicking the 'cite' button. However, I don't know how difficult JSTOR make this.
- URLs of the format https://www.jstor.org/stable/43154584 are absolutely supported using Zotero and the browser connector, which is how 99% of users use Zotero, so 'not supported' is a misnomer.
- JSTOR blocks what it perceives as automated scraping. That's clearly true for ZBib and translation server via the web library, but appears to also affect your node.js server on localhost. Not sure why, but that's a question for them.
- Zotero's translation server will query DOIs before URLs, which is why importing https://www.jstor.org/stable/10.5325/philrhet.47.3.0227 works: you can actually tell that the data are coming from CrossRef, not JSTOR, by checking the Library Catalog field after import