Duplicates
Hi,
I test this new feature and have a comment.
When different titles are detected, when I click on the little icon, I could see the first 5-6 words. Which are identique, so I don't know what could be the difference. I suppose it's the presence/absence of the final dot. Could it be possible to see the full title? Maybe with a tooltip.
Other question: could it be possible to easily see the collection provenance? If I understand well, merging is only about the "content". I would like to have the ability to just delete one copy from certain folders, directly from the Duplicates folder.
Hope it's clear,
Thanks
Mel
I test this new feature and have a comment.
When different titles are detected, when I click on the little icon, I could see the first 5-6 words. Which are identique, so I don't know what could be the difference. I suppose it's the presence/absence of the final dot. Could it be possible to see the full title? Maybe with a tooltip.
Other question: could it be possible to easily see the collection provenance? If I understand well, merging is only about the "content". I would like to have the ability to just delete one copy from certain folders, directly from the Duplicates folder.
Hope it's clear,
Thanks
Mel
Is there a way to provide some options for the duplication determination such as using title, year, author for example?
Thanks.
I'd be reluctant about options - this is a feature that should just work and shouldn't require a separate tab in some preferences where users have to tinker. Too many options make software less user friendly.
I'm having a hard time figuring out how start the duplicate detection process. I'm trying to figure out what reference manager software to use right now, and although zotero seems just great, duplicate detection is a must for me. I've upgraded to the 3.0 version and have gone through all of the menu buttons but can't seem to figure out how to do it.
Any help would be much appreciated.
Thank you!
Allison
Restart Firefox, go to the duplicate folder and then see if there is an error report that you can submit (and if so post the error ID here).
http://www.zotero.org/support/reporting_bugs
Duplicate detection works for regular Zotero items.
Zotero does detect metadata for PDFs, but it doesn't do it automatically like Mendeley - you need to select them - can be multiple at once - and right-click --> Retrieve Metadata for PDF.
Once you've done that (and if it works - the rate of success varies), the pdf will be attached to a Zotero item (I think I remember that's the same in Mendeley). Those will be recognized as duplicates.
The Duplicate search should ignore/exclude items which are related. The relation between two (or more) items has to be entered manually, and then clearly, the user does not consider these to be duplicates.
Thank you for the great tool!
Kind regards,
Suppose I have related items Ax and Ay, and related items Bx and By. Suppose also that all items in both sets would ordinary turn up as a single duplicates group. In that case, how should the duplicate relation be shown in the UI? And if the user requests a merge how should that be done?
We do have one instance where related items are generated manually. The Google Scholar translator will set relations between alternative cites to the same legal case. If you download the case twice, you have the situation described above.
In the Google Scholar parallel cites situation, it would be sufficient for the algorithm to be fussier about matching, so that congruent elements in each set are properly paired in the merge display.
The example described by you is complex ({Ax, Ay}, {Bx, By}). Parentheses () stand for a duplicate group and curly brackets {} for related items. And as such a group, it would be displayed in the list as a duplicate group.
[basic assumption:] When an item is displayed in the duplicate list, it is known to which other items it is a duplicate to, right? Now, when each of the items in the duplicate group (Ax, Ay, Bx, By) are related to each of the items in the same group, this group will be ignored from the list.
That is false for your example of a complex duplicate group ({Ax, Ay}, {Bx, By}): The duplicate item Ay, is not related to Bx and By. But it is true for a simple case, when no Bx and By exist. I can name a couple of such duplicate-related items:
- a book and a chapter from that book (the book title and a chapter name are similar) [related]
- a paper in a journal and a conference contribution (the title and the authors are the same) [related]
- a paper in a journal and a webpage of the scientific topic (same title, same author) [related]
- a book in two different editions (title, authors are the same, ISBN, edition, abstract are different) [in a relation]
- a document in two different language versions (same authors, different titles and language) [related]
- two books with the same titles, different authors [related]
Or, there is another question: Am I misusing the relation feature?
I haven't experienced the Google Scholar automated relation setting yet. But even it is non-manually set, it fits to the above described scenario (simple pairs are ignored, complex groups are displayed).
p.s. I love "fuzzy" :) it has been always a great help when it occurred in my way...
I'm easily confused about these things, but I think I follow your logic, and I think it makes sense.
Dan, any timeline on implementing the "usual method"? Right now, I need to go back to the full library display to find out which collections will be changed for each duplicate.
The duplicates feature is great, BUT - I have 5 papers from a single conference (obviously the same conference name for each of the 5). Each has a different author(s) and a different paper title, but Zotero treats them as if they were duplicates. They need to be 5 separate references and it is only by looking at the list I realised what was happening. Is there a way of marking them 'not duplicates' so this doesn't happen the next time I look for duplicates? thanks
"Otago cavalcade, 1911-1915"
"Otago cavalcade, 1901-1905"
These are definitely different books (the titles show that), but they show as duplicates.
Any suggestions for how I could make them definitely not duplicates?
http://forums.zotero.org/discussion/22395/duplicates/#Item_6