automating mass-import from PDFs
This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.
If you're going to use the dropbox/symlink method as you suggest you might in the other thread, you _do_ keep your pdfs in Zotero storage - the symlink is then used to mirror the storage folder to dropbox.
This is my favorite way to keep my .pdf library--it can be read anywhere, including off-line, and although the structure is complex, it more or less makes sense to me and I can usually find old papers this way even when I can't remember author names or other useful details.
I just don't want to have to drag&drop each file. Is there a way to attempt this from the command line? I'm running Mac OS X.
The easiest would be to create a virtual folder with all your PDFs on one level and then drag those to Zotero in a couple of batches.
If you don't know how to do this on a Mac it should be easy enough to google.
automated import of a batch of pdf's then association with high quality metadata would be a great feature to attract many new zotero converts as it builds upon existing efforts (perhaps not ideal) to organize a set of pdf's.
After dragging and dropping a few pdf's, the "Retrieve MetaData for PDF" menu function seems to work reasonably well at getting the correct data to ultimately generate a citation. However, it would need to work very well to encourage any seasoned researcher with thousands of manually curated .bib entries to make the switch.
I wonder has anyone implemented a way to batch import a complete directory structure, then associate with each pdf one or more tags according to the names of the folders (or subfolders) which contained the orginal pdf? It is a common legacy issue that researchers have stored their data in such a hierarchical tree. Especially for interdisciplinary research it is impossible to maintain a single hierarchy as a given paper could easily be placed within two hierarchies. I understand that tags are a way to overcome this issue but do I really have to manually add all these tags when they are implicit in the existing file structure where the pdf's are stored?
Regards,
Ronan
I want the meta data because I have some pdf files with names are unrelated to its content.
Thank you very much!
A major reason for not doing this automatically is the google lock-out described in this thread.
edit: for further discussion of this, please do start a new thread as Dan asked you to.