HLHJ

About

Username
HLHJ
Joined
Roles
Member

Comments

  • Do publishers generally use CC licenses in a readily machine-readable form? Is it always easy to scrape a CC license? If so, it definitely deserves its own field, and a standard format, uncluttered with non-machine-readable stuff.
  • Sorry for the totally ignorant suggestion, I didn't know Crossref wasn't used to get DOIs. The most common metadata problem I get is searching by ISBN or something and getting a bunch of PMIDs, if I'm remembering correctly. It's a tad annoying …
  • I think we can all agree that in principle, ORCIDs are a great idea. Author's websites often have only selected publications, and sometimes no DOIs. Sometimes the author has no online publications list, and it can take forever to be moderately certa…
    in ORCID Comment by HLHJ July 27, 2014
  • I'd love to be able to update metadata, but I'm not suggesting it here. I'm suggesting lifting the scraping algorithms from OAG and use them to improve Zotero's scraping. For examples of copyright data, in my own collection I have: copyri…
  • This is happening! We are currently making a database of the metadata of everything with a DOI in WikiData (the database sidekick of Wikipedia). It is CC-0. I think this addresses most of the problems discussed here. Zotero does not have to ho…
  • Sounds nice :) . If the PDF is clean and the bibliographic entries have DOIs, stages 1-3 should be possible. Unfortunately automated mass downloads are frowned upon by many publishers; they might block you. So you would probably have to do the …
  • Dan Stillman's idea sounds like a good one to me. It has the advantage that the papers used by Zotero users are more likely to resemble those used by other Zotero users than those used by users of Google Scholar. It also gives an incentive to get ot…
  • It would be nice to be able to choose the sources of your metadata, including the Mendeley database. Could OAI-PMH be used? Mendeley’s database is freely accessible under a Creative Commons license, but is it forkable? Can someone mirror it, l…
  • Curating the metadata would be a huge job. I don't suggest Zotero does it. It would be a really useful job, though, and Zotero would make a great tool for anyone doing it, even without integration. Let's see if we can figure out who might do it, and…
  • Thanks, adamsmith. That looks like it would work on any OCRed PDF. I'll give it a try, and sorry for not spotting it before I posted.