OCRing a PDF after annotating in Zotero

edited April 4, 2023
I'm working with a lot of large PDFs made up of scanned handwritten documents. I am going to import these into Transkribus, run the OCR, then re-export the PDFs and import them into Zotero. However, some of these PDFs I have already added into my Zotero and annotated prior to OCRing. Would I be able to take the new, OCRed PDFs (which are identical to the old files except for having the text layer underneath) and simply replace the old PDF file (I have stored them in Zotero as links) and have the annotations be preserved? I guess the actual question is, are the annotations in the Zotero database keyed to any particular version of a PDF file?

(If I'm unable to do this, I will have to resort to the other alternative, which I've already done successfully with other documents but is very time-consuming: export the PDF from Zotero with PDF comments, extract the comments via Foxit PhantomPDF or similar, reimport the extracted comments (which are saved as a text file) into the new PDF also with PhantomPDF, and then reimport *that* new PDF back into Zotero and extract the comments as annotations.)
Sign In or Register to comment.