ZotFile - Advanced PDF management for Zotero
This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.
This discussion has been closed.
But their is another question/problem i have trouble with..
I use a lot of pdfs i have to ocr first, before i can work with them (dpi300x300 scans / normal modern letters) Zotfile seems to have a hard time extracting correct annotations from the pdfs, especially on the line break the addon is prod. a lot fails.
Is it a problem of bad orc software (I use finereader) or a problem of the zotfile code? Will it be any better in the future?
best regards, E.
The ocr was made with adobe finereader professional 12 (standard settings "picture" into "pdf" /> the annotations with the drawboard.pro app on a surface-book.
I would be very pleased to get your help and advice on how to get the annotations paragraphs extraced correctly.
best regards, E.
Zotfile: "Obwohl angesichts all dieser Turbulenzen die Bedrohung der Men Zeitgenossen anders empfunden haben schen durch klassische Unsicherheitsfaktoren wie Kriminalität im Rück blick eher marginal erscheinen mag, ist zu konstatieren, daß dies viele"
Preview copy & paste "Obwohl angesichts all dieser Turbulenzen die Bedrohung der Men schen durch klassische Unsicherheitsfaktoren wie Kriminalität im Rück blick eher marginal erscheinen mag, ist zu konstatieren, daß dies viele Zeitgenossen anders empfunden haben"
Edit: Here is a ticket for it: https://github.com/jlegewie/zotfile/issues/221
thanks for your time and the support ticket you opened at github.
It would be a milestone for the workflow for me and a lot of people i know working in the humanities here over in europe (were professional edited and especially ocr'ed papers are still rare..)
You know any other method to get the annotations out of the pdf and then back into my zotero? (excluded "copy+Paste")
best regards from Germany,
E.
https://www.youtube.com/watch?v=4aDvAPLZwCY
it can be very interesting for you :) (exactly video time 7:25 and next)
I have tried Zotfile to extract the highlights for a pdf in foxit reader and it worked as advertised.
The problem is it doesn't work for this pdf here, it is in ascii and I'm using the same foxit's built in annotation maker. What is the problem?
Link to the pdf
The article is free to download. Just click on the blue download link to the right of the webpage.
Thank you in advance
I'm struggling with the same problem zurpher and mbruffey described early this year:
-----------------------------------------------------------------------------
"zurpher Jan 22nd 2016
I have scanned a text and OCR'ed with Adobe Acrobat Pro DC v2015.006.30033 I use Firefox (43.0.4), ZotFile (4.1.6), Zotero Standalone (4.0.28.7), Windows 7 When trying to extract my annotations, I get a notice “Zotfile: Extracting Annotations…” but then the circle stop after about a quarter, the notice disappears and not extractions are imported into a Zotero note. I can copy and paste from that document so it should work. I had a similar issue previously that Joscha was able to fix. Any ideas how to fix this one? I have posted the document online in the zotfile Group Library."
-----------------------------------------------------------------------------
mbruffey Feb 4th 2016
Cannot Extract Annotations
It has been a long while (at least a couple of years) since I used the Zotfile Extraction feature. I can't extract notes (from a PDFXChange'd file). I tried on several items tonight, in both Juris-M and plain old Zotero. Extraction seems to begin properly, but the circle never completes its compass, halting about one o'clock, at which time the popup window disappears. I'm on Ubuntu 14.04 with its latest Firefox and the latest Zotfile. For good measure, I attempted the operations in a brand new profile with only Zotero and Zotfile as addons. Thanks, M
-----------------------------------------------------------------------------
How can we fix this? In my case, I was able to extract annotations from a file. But I've continued put annotations in it and now I cannot extract them. And I should add that I've been using a lot of colors. Any chance of those things have ruined everything?
Thank's!
Thiago
How do it *remove* PDFs from my tablet once i've finished reading them? There is no "Remove from table" option in the context menu -> "manage attachments" section.
I suppose I can remove the _tablet tag from the item and delete it from the tablet sync folder, although my attempts to do this so far result in zotfile reporting "missing files". Is there a better way?
@thiagoafdoria, are you using version 4.2.6?
@livingthingdan, "Manage Attachments -> Get from tablet". Don't remove the _tablet tag manually. That screws things up (as described in the documentation).
I'm using Zotfile to catalog a library of pdfs. Today I ran into this problem. I'm storing a working paper I wrote myself. It has three parts: the original paper, a set of illustrations (graphs), and a set of statistical tables. Zotfile's naming mechanism changes their names to "metadata", "metadata_2", and "metadata_3". Years from now, I will never know what the differences between the files are. Instead, I'd like names like "metadata_text", "metadata_graphs", and "metadata_tables".
I realize Zotfile has the hidden option .disable_renaming. This would allow me to customize the names as desired. But changing and resetting hidden options is cumbersome for just one item, or for a number of items encountered on an ad hoc basis.
So is there an easy way to customize the name of an individual attachment when using Zotfile?
Would it work to go into the Zotfile library (directory hierarchy), change the names manually, and then link to them without using Zotfile's Rename Attachments feature?
(Suggestion for enhancement: add a "Custom Rename" option to the Zotfile menu, which will do two things. (1) It will allow the user to customize the attachment's name. (2) It will flag such a renamed individual attachment to be exempt from future renaming according to the general renaming rules.)
Ma question: dans le fichier d'extraction des annotations, je souhaiterai avoir les références avec la norme APA (Joscha, 2016, p. 23) à la place de (Joscha, 2016:23). Comment faire?
Merci
Still, I do think my suggestion would be a good enhancement to Zotfile. Not only might it be more intuitively obvious than clicking the bold title, it would also keep fools like me from accidentally resetting things with Zotfile's auto-rename.
In general, I'm a big fan of the software tools/Unix philosophy of "do one thing well." But in some cases, it's warranted to have multiple ways of doing one thing. I think this is one of them.
My question: in the extraction file annotations, I would like to have the references with the APA norm (Joscha , 2016, p . 23) instead of (Joscha , 2016: 23). How to do?
Thank's
I have Zotero set to use a base directory, so I can sync with my laptop via Dropbox.
I have Zotfile set so that after adding an article, it will rename it following certain rules, and store it in a sub-folder (within the base folder) named after the year of publication.
I've found two issues:
1- After adding an article (example: http://arxiv.org/abs/1605.05330) it will be properly renamed and stored by Zotfile. If I then remove this article from my library, moving it into the recycling bin first and then *removing it from here too*, the entry in Zotero is gone as it should, but the PDF file will remain orphaned in the sub-folder.
2- The following arXiv entry is not being automatically renamed and moved to its sub-folder by Zotfile and I don't know why: http://arxiv.org/abs/1605.05700. I have to manually force the renaming, and then it gets moved properly.
The first issue is a serious one since it means Zotfile leaves trash behind that one is then unable to trace. The second issue is weird but not a show-stopper (as the first one I believe is)
Cheers.
For this to work the way you want, ZotFile would either have to interface with how Zotero treats file links in a much more heavy handed way or Zotero would have to treat links to files in a completely non-standard way. I don't really see either of those happening any time soon (the former I think is conceptually fine, but a mess to implement; the latter I don't think should happen ever).
Is there any way to track down these orphan files manually? I hate leaving trash behind like this..
As I can only code in Python, I'm not sure I could come up with an entire plugin. Perhaps a simple script that scans the Zotero database and compares with the stored PDFs by Zotfile. How can I access my full database? Which file should I look into?
Also, what do you think of issue 2?
No idea on 2. If you can replicate it reliably, report via github.
Cheers.