Creating a searchable repository of local journal articles?
I have a rather large locally stored library of articles from academic journals, magazines, etc. Each of these articles is in the form of a PDF file. I am having trouble getting Zotero to work for me in this scenario and I'm thinking that perhaps this just wasn't one of its intended use cases. For starters, the only way I can even find to add a locally stored article to my library was by clicking the Add Link to Local File button in Zotero's main window. But when I do this I don't get the same kind of metadata window on the right, all I can do is enter a few notes, tags, and related articles. Furthermore, right clicking the file and selecting Build Bibliography or whatever the option is called tells me it does not contain a bibliography or any references, even though I can open the document and near the bottom I can see that it clearly does. My impression is that it was able to scan this document and parse a bibliography since academic articles almost always use a consistent format for storing references. Am I missing something? Or is this possible?
Thanks
Thanks
For Zotero 2.0b:
Drag&Drop a bunch of your pdfs to Zotero. Select them. Right-click and select "Retrieve Metada for Pdfs" - this will a) store the pdfs in Zotero and full-text index them and
b) (where Zotero finds Metadata) attach them to a bibliographical item from which you can then create a bibliography.
Note the way that Zotero works with files: You can _never_ create a bibliography from a file - it is always the "Metadata" - usually the item to which a file is attached - which is used for creating a bibliography.
If you're on Zotero 1.0 things are a bit more difficult and you don't have the "retrieve metadata" function at all.
As a final word: The retrieve metadata is still in beta: It won't always work, and it won't always be right.
For the next bit you would need to be running Zotero 2.0 beta (in my opinion it is pretty much as stable as 1.0 if you don't use the synchronisation features, and quite stable enough for everyday work even if you do). In 2.0 beta select all the PDFs you have just imported and select "retrieve metadata for PDFs". This searches Google scholar for information on the PDFs and automatically associates them with full bibliographic metadata. Note that you would probably want to do this at a rate of not more than a few hundred PDFs each day - otherwise you will fall foul of a security policy in Google scholar which prevents excess automated requests for information and you will be blocked from Google scholar for a few hours.
If you already have your PDF library organised in a folder structure your best bet is to replicate this using collections (left hand pane) and import into these collections a folder at a time (files will import into the currently selected collection).
Once you are happy that everything is safely in Zotero, and that Zotero is the right tool for you you should be able to safely delete the PDFs in their original location if you want to (note that keeping a regular backup of your Zotero data directory is always a good idea)
This feature "create bibliography from selected item" doesn't do this (although I agree the error message does somewhat imply that it does). Instead it allows you to select a number of items and make a bibliography of those items which you can then paste into a document or email etc. (when actually writing an article/paper/book though you would be better advised to use the word processor plugins, "create bibliography" is a rather quick and dirty method). You are presumably using this tool on your links you created to external PDFs, which will fail as there is no metadata associated with these standalone links with which to build a bibliography.
I don't believe that you can parse the bibliography from a PDF itself with Zotero, and I am having difficulty seeing why you would want to do this, at least in the biological sciences (my field) this would just result in a huge list of articles, most of which would have only marginal relevance to the topic of interest. To store specific references of interest from an article you are reading you can find these online (using google scholar, web of knowledge, scopus, the publisher's website etc. etc.) and grab them using the "page" or "folder" shaped button whcih should appear in the Firefox address bar when you are looking at the article page or a search results page. Alternatively you can copy and paste the DOI, ISBN or PMID number into the "add item by identifier" box (magic wand button) and Zotero will attempt to find the appropriate article/book.
a) How is it generating the metadata? I don't have any PDF metadata stored in the file but it's able to figure it out somehow.
b) Sometimes it figures out the article metadata, but it is completely and totally wrong and has nothing to do with the actual article. Is this just a known bug that I have to deal with?
c) Sometimes it just fails on certain files and says "No matching references found." How can I manually do what it's doing and create this top level document node with a link to the PDF under it?
1) Find the paper/article online and click on the icon which appears in the address bar to automatically save metadata, then drag the PDF onto this item (if a copy of the PDF isn't automatically saved - for this to happen you may need to activate the appropriate option in the preferences).
2) click the "+" icon, choose the appropriate reference type, and drag the PDF to the new blank item, then fill in the metadata fields by hand.
There has been talk of option 2 happening automatically if metadata retrieval fails.
If an article is correctly guessed by Zotero, but it was not able to determine the DOI it fills out some of the information, but not necessarily as much as if it had the DOI. Is there a way for me to manually enter the DOI and have it reload the item? I can go enter the DOI directly into the metadata, but it doesn't appear to trigger it to fetch more metadata. I can always just create a new entry and drag the PDF from the old one to the new one, so this isn't that major.
Also, is there a way to have it import the contents of an entire journal and just create Journal Article entries for each one which I would manually go through and attach to individual files by drag/drop?
That would work if the journal is online with one of the common publishers or JSTOR, EBSCO etc. - you would then go to the issue of the journal and click the folder in the address bar of Zotero and select all - that should do.
As for the first question: You can't reload, no - I take it you have found the magic wand where you can add an item by identifier (including the DOI).
ACM Transactions on Graphics, Volume 28, Issue 3
Is there a way to import all the articles from this journal into Zotero so that I get one Zotero item for each article, to which I can drag the corresponding PDF?
Apparently there is a way to "save an inssue to a binder" when you have an ACM account -
If you could export that to Endnote (and have the automatically import RIS checked in the Zotero General Preferences) that _might_ do it for you. But let me emphasize _might_ as I haven't tried this out.