Deleting duplicate PDFs

Hi all,

I've been trying Zotero standalone recently, and so far it's been great. One nagging issue for me is that I've got a large collection of PDFs including a significant number of identical copies of certain documents (this is my fault, not Zotero's). These documents have identical content at the binary level, return the same hash, etc. For better or worse, I now have many of these duplicates in my Zotero library. With that background, two questions:

1) Is there a way to remove/merge/etc. binary-identical files from within Zotero?

2) If not, is there a more hackish way to remove all but one of the identical files from Zotero's storage directory? I've written simple scripts in Python to identify duplicates by hashing files under that path, but I don't want to cause Zotero to freak out by removing the duplicates entirely.

Any help would be greatly appreciated!
Sign In or Register to comment.