automating mass-import from PDFs
This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.
Upgrade Storage
If you're going to use the dropbox/symlink method as you suggest you might in the other thread, you _do_ keep your pdfs in Zotero storage - the symlink is then used to mirror the storage folder to dropbox.
This is my favorite way to keep my .pdf library--it can be read anywhere, including off-line, and although the structure is complex, it more or less makes sense to me and I can usually find old papers this way even when I can't remember author names or other useful details.
I just don't want to have to drag&drop each file. Is there a way to attempt this from the command line? I'm running Mac OS X.
The easiest would be to create a virtual folder with all your PDFs on one level and then drag those to Zotero in a couple of batches.
If you don't know how to do this on a Mac it should be easy enough to google.
automated import of a batch of pdf's then association with high quality metadata would be a great feature to attract many new zotero converts as it builds upon existing efforts (perhaps not ideal) to organize a set of pdf's.
After dragging and dropping a few pdf's, the "Retrieve MetaData for PDF" menu function seems to work reasonably well at getting the correct data to ultimately generate a citation. However, it would need to work very well to encourage any seasoned researcher with thousands of manually curated .bib entries to make the switch.
I wonder has anyone implemented a way to batch import a complete directory structure, then associate with each pdf one or more tags according to the names of the folders (or subfolders) which contained the orginal pdf? It is a common legacy issue that researchers have stored their data in such a hierarchical tree. Especially for interdisciplinary research it is impossible to maintain a single hierarchy as a given paper could easily be placed within two hierarchies. I understand that tags are a way to overcome this issue but do I really have to manually add all these tags when they are implicit in the existing file structure where the pdf's are stored?
Regards,
Ronan
I want the meta data because I have some pdf files with names are unrelated to its content.
Thank you very much!
A major reason for not doing this automatically is the google lock-out described in this thread.
edit: for further discussion of this, please do start a new thread as Dan asked you to.