Plugin to fetch missing abstracts?

alexcr87 · February 13, 2023

Abstracts are really useful when you want to parse a lot of items in one go, sadly many items in my database have missing abstracts. Does anyone know if there exists a plugin that could programmatically fetch abstracts when they’re missing from item metadata?

adamsmith · February 13, 2023

Pretty sure there's nothing currently, no.

alexcr87 · February 13, 2023

Yeah I just checked on Crossref, which I know other plugins fetch their data from (e.g., DOI) and abstracts aren’t part of metadata in their database. That would likely mean a plugin would have to fetch the abstract from individual journals—a pretty hard task I guess.

The only other option I see would be to fetch from Google Scholar or Google Books, but other plugins have had trouble not being locked by Google’s anti-abuse mechanisms when processing a large number of items.

adamsmith · February 13, 2023

CrossRef does, increasingly, have abstracts, so in theory that'd be an option, as would querying PubMed where a PMID is present, and OpenAlex, which might have some more data than CrossRef, but no one has done that so far. Wouldn't be prohibitively hard, though, using either the DOI Manager or the PMID fetcher as a template.

alexcr87 · February 13, 2023

Interesting. I just checked and it’s indeed pretty straightforward with a DOI to obtain e.g. the PMID, and export it as structured text (nbib) with the abstract included. That means the plugin would be a simple loop over item abstracts, and if blank curl a request from pubmed—seems pretty simple in theory. I have zero experience developing Zotero plugins (in Python it would take me 10 minutes of work heh) but maybe I’ll give this a try once things settle a bit with work!

alexcr87 · February 13, 2023

Actually I just remembered about pyzotero ; maybe I could try to make a script that does a one-time backfill for the 1500 items in my library whose abstract is missing.

Papers will be probably be easy, but for books I’m not sure. Any idea how I could get book summaries with their ISBN? Not sure if Google Books will make it easy to extract that amount of information.

adamsmith · February 13, 2023

Library of Congress has book summaries in a lot of its MARC data -- you can check out the calls Zotero is already making as part of its ISBN lookup https://github.com/zotero/translators/blob/master/Library of Congress ISBN.js

Agree, if you just want to backfill, using pyzotero is going to work pretty well.

alexcr87 · February 13, 2023

Interestingly, my library seems filled with books that have either,

a) no ISBN, or
b) an ISBN, with "Library of Congress ISBN" in the "Library Catalog" field
c) an ISBN, with "Open WorldCat" in the "Library Catalog" field

I’ve tried searching the LoC for books of category (b) but oddly can’t find them in their database. I’m gonna guess that Zotero would have pulled the abstract already if it was available anyway.

Also odd, tried looking for books of category (c) by ISBN in WorldCat and they *do* seem to have abstracts. Strangely Zotero did not pull those out?

In any case, that does seem like an interesting problem to solve...

lucaswiese · February 16, 2023

If you end up putting a script/plug-in together to gather abstracts by DOI, I would be very interested! Looking to find a way to get abstracts by bulk for my non-abstract items in a big collection.

foxsayswhat · February 28, 2023

From PMID, the Abstract Export option generates good data. It's missing field labels or delimiters. Perhaps there's a way to merge the Citation Manager RIS file with Abstract entries? VLookup?

declantaylor · April 4, 2023

@alexcr87 I'd be curious if you could share your code? I'm trying to source a whole bunch of abstracts (in my case, the 10s of thousands) for a research project, starting with .bib files, of which about half the citations have abstracts.

clarkemoyer · October 26, 2023

I would love this script or plugin as well.

Elnahir · March 27, 2024

Apologies for the necrobump, but would anyone be OK with sending money to a willing/interested dev, who might create such a plugin? I'd love to donate money!

It would be a godsend to fetch the abstracts in folders with hundreds of papers automatically, and not just by hand.

FeralFlora · March 28, 2024

@elnahir, no need, the Linter addon already does this:
https://github.com/northword/zotero-format-metadata

seredes · July 30, 2024

@FeralFlora Hello, and thank you for giving me the opportunity to try a new add-on. I have installed Linter (version 0.44 for Zotero 6), but it does not retrieve the abstract. Does the version for Zotero 7 get it?

seredes · July 30, 2024

@FeralFlora looks like it works now! First I had to make sure that all the citations had a long DOI with the DOI plugin. After that, I rerun Linter and it fetched the abstract.

EDIT I talked too soon... only some abstracts (a minority) are fetched. If anybody has ideas about how to fetch all abstracts, I would appreciate it!

EDIT I can paste the bib file into ChatGPT/Consensus and it is able to get the abstracts and generate an updated bib file that one can copy / paste.

EDIT Consensus is very finicky unfortunately. it works, then it does not work etc. Looks like the quickest method is to get the abstracts by hand :(

Elnahir · July 30, 2024

Gotta say, that last edit broke my heart a little. :/

PaulS42 · October 8, 2024

Any updates on this? I can't get the Linter to work either.

stepap · March 20, 2025

Just tried the latest version of Linter with Zotero 7 and it doesn't fill out abstract.