Can zotero search notes added inside pdf?

Hi,

I'm thinking of adding notes directly inside PDFs. However, when Zotero runs a full-text search through PDFs, will it search those notes as well? If it doesn't do it by default, is there a way it can be made to do it? Thanks!
  • you can use Zotfile to extract notes from PDFs (and then search notes).
    http://www.jlegewie.com/zotfile.html
    I don't believe pdftotext, the tool Zotero uses for pdf indexing, reads annotations, so no to the indexing question (though I'm only 80% sure - you'd have to test this yourself).
  • Grand, thank you!
  • Can Zotero use the iFilter tool from Acrobat for searching within PDFs?
    http://www.adobe.com/support/downloads/detail.jsp?ftpID=5542
  • no. It's Windows only and released under a non-free license.
  • edited January 26, 2015
    The Extract Annotations feature of ZotFile seems to be difficult to use efficiently. It gathers comments and other annotations on the PDF into a Zotero Note attachment, but it doesn't maintain this well. So if you later add a different comment to the PDF, this won't show up as a Zotero Note attachment unless you run the extraction process again...in which case you now have duplicates of all the old comments.

    It seems like PDF comments should be the *easiest* thing for pdftotext to extract from a PDF file. (See agreement from noksagt here: https://forums.zotero.org/discussion/7386/search-comments-in-pdfs/ ). If Zotfile can do this, presumably it's possible for Zotero to do it to, although this might require Zotero to run pdf2text *and* pdf.js, which could be cumbersome. I'd very much like to see this solved if possible.

    See also development on "Leela": https://bbs.archlinux.org/viewtopic.php?id=142309
  • the hope is to eventually switch to pdf.js, which does read notes (it's what ZotFile uses, albeit in a modified version). I wouldn't get my hopes up on this being included in pdftotext, regardless of how easy that'd be, though you can certainly ask them.

This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.

Sign In or Register to comment.