retrieve pdf metadata problem
Hi Again,
Strange problem - I watched the retrieve pdf metadata screencast and tried it on a number of PDFs.
The first time I drag and dropped 3 PDFs I got the option "retrieve metadata from PDF" when I right clicked. i tried it but it didn't work (opened a window with a small rotating circle which never stopped). I canceled and tried again - still nothing. I than tried it with a few other PDFs I imported and to my surprise I no longer had the option to retrieve PDF metadata at all when I right click.
Any ideas what is going on?
Iddo
Strange problem - I watched the retrieve pdf metadata screencast and tried it on a number of PDFs.
The first time I drag and dropped 3 PDFs I got the option "retrieve metadata from PDF" when I right clicked. i tried it but it didn't work (opened a window with a small rotating circle which never stopped). I canceled and tried again - still nothing. I than tried it with a few other PDFs I imported and to my surprise I no longer had the option to retrieve PDF metadata at all when I right click.
Any ideas what is going on?
Iddo
I posted a comment a while ago but never got a response from anyone. It has been mentioned in the past but the problem remains. The metadata retrieved is also incorrect and incomplete. The only engine used to retrieve is Google Scholar.
I also anxiously await the ability to drag my PDF's into Z without having them duplicated into the Z files.
Matsumura Y, Nagashima M.
In this article when dragging the PDF to Z and trying the retrieve the metadata, the author I got was "Organs C.T." without the journal and pages but it did get the volume and issue.
In the mean time a window keeps appearing on the top of my screen asking me to enter a URL.
Thanks for your help and look forward to your suggestions.
I just don't have the option to to get the meta-data.
On my desktop I never had the option and on my laptop I had it and now I don't.
Interesting - why is this not installed as a default? I would have never guessed I need to do that.
O.K. now I have the option - I tried two PDF files and it did not find any metadata - can you direct me to a free PDF I can download which you know for certain that has retrievable metadata so I can try and see if it actually works?
Thanks,
Iddo
So apperntly many PDFsfrom Jstor don't have matadata - what a shame :(
So basically what you are saying is that most PDF I will try to import this way will not have metadata? is there any conceivable way around it apart of course from typing all the data myself which is something I don't really want to do for hundreds of PDF files I already have?
Just a thought - why not use OCR application and build an algorithm that can try and extract the title, author, year of publication etc. from the front page of a PDF?
It won't be 100% (probably not even 70%) but it might be better than typing everything by hand.
I don't know if OCR has been discussed before. This seems rather heavy and platform-dependent to be a reasonable dependency to me, but the idea of allowing end users to plug-in command line apps for indexing has been discussed. If custom commands could be run on file attachment or for indexing, you could insert your favorite ocr app into the chain before pdftotext.
You talk about "a large pool of data" - is this something you are currently activly looking into or are we talking about a distant future?
Engin
One of the options that I would like to see is the ability top pick your repository. I agree that Pubmed is great and actually have been happy in all of my records taken directly from it. How do you save your PDF's? I have been saving them all in YEP with tags. This allows me to pull up all PDF's related to a subject and view in one glance. Although having PDF's associated with a citation in Z is functional, it does not have the visual feature of YEP and ends up duplicating the file on your drive.
Rashid
I do not save my PDFs any specific way any more, now that I'm using zotero. But I see your point.
Simon,
It does make me wonder, however, how other software manages accuracy in metadata retreival (I am specifically thinking of "Papers" which is available only on Macs). I guess their algorihthm is different, and if so, what are the chances of improving the current zotero algorithm?
"I just don't have the option to to get the meta-data." when right clicking on a pdf in the Zotero library.
I followed Simon's instructions:
"Iddo, you need to install the PDF indexer for this feature to work. Go to "Search" in the Zotero preferences and click the "Check for installer" button. This should enable the option for you."
Confirmed that I do have PDF indexing (version 3.02) but I wonder if that is the problem? I'm using Zotero 1.07, updater says that is the most recent version (for Apple Os X.5), but Iddo stated 1.5 syncing with 3.2.
So this new feature only works on the Zotero 1.5 sync preview?
Any hints?
I just tried this PDF again and it worked quite nicely, so it looks like this is not a general issue but something specific to your configuration.
Thanks,
Gary
Tried another article that is definitely found in google scholar, also did not find metadata.
I am using NitroPDF not Adobe reader, any connection? PDF document properties do show the right title and author...