Duplicates across collections imported from multiple databases (meta-analysis)
I'm starting work on a meta-analysis and have chosen several relevant databases (DBs). For the final batch, I expect to be left with around 15-20 studies to be meta-analysed.
Guidelines such as PRISMA recommend that authors report the number of initial results in each DB (unclear if before/after filters have been applied), and how many of those overlap between DBs (number of duplicates). Thus, in this case the aim is not to merge duplicates but to count them and count also the unique items.
So, on to my questions:
1) Once results from each DB are imported into a collection, how can I arrive at the number of duplicates that exist between all of them, AND
2) .. how can I create a new collection formed of unique items from across all collections?
3) Some of my DBs (e.g. Google Scholar, ExLibrisPrimo) don't have an option to export *all* search results to Zotero (or even as an XML/XLS file). The 'Zotero Item Selector' button in Chrome allows me to only save as many items as the DB allows to fit on one results page, so only 10-or-so items at a time can be imported. Is there a better way?
I'd guess no one feature of Zotero specifically exists for these aims, and that a creative workaround would have to be found. Many thanks in advance to this great community.
Guidelines such as PRISMA recommend that authors report the number of initial results in each DB (unclear if before/after filters have been applied), and how many of those overlap between DBs (number of duplicates). Thus, in this case the aim is not to merge duplicates but to count them and count also the unique items.
So, on to my questions:
1) Once results from each DB are imported into a collection, how can I arrive at the number of duplicates that exist between all of them, AND
2) .. how can I create a new collection formed of unique items from across all collections?
3) Some of my DBs (e.g. Google Scholar, ExLibrisPrimo) don't have an option to export *all* search results to Zotero (or even as an XML/XLS file). The 'Zotero Item Selector' button in Chrome allows me to only save as many items as the DB allows to fit on one results page, so only 10-or-so items at a time can be imported. Is there a better way?
I'd guess no one feature of Zotero specifically exists for these aims, and that a creative workaround would have to be found. Many thanks in advance to this great community.
http://zotero.org/support/collections_and_tags#special_collections
Regarding importing whole search results, for Google Scholar, first save the items to your "My Library" by clicking the star icon under references, then export in one go using the Export button on the My Library page. You can also search using the Publish or Perish program and export from there. There are similar systems for other databases.
Also, after I've merged all duplicates, it seems to me each collection will retain its individual (unique) items, but how can I merge them all into a new collection, since it will no longer be of interest which DB they originated from?
It seems to me Zotero's algorithms are quite good (certainly better than what I'd achieve by hand - and faster to boot), but there is still no Merge All option.
And what about merging the individual collections after duplicates across all have been removed?
Regarding Merge All, there is no such function, but I personally suggest not using such a system to avoid inadvertently merging false positives. I’ve done a dozen meta-analyses with thousands of hits with Zotero. Merging all of the duplicates with Zotero takes 15-20 minutes.
I did not see any New Library option (only New Collection) so I assumed I'll have to work within MyLibrary. I'll be doing this alone so not sure if the Group Library is the most appropriate way to define a new library?..
1) Once results from each DB are imported into a collection, how can I arrive at the number of duplicates that exist between all of them, AND
I created individual collections in the library for each database under Dissertation (I did not use the Groups as advocated because I did not see that tip earlier. I will do on my next SR!)).
I then tagged all the files as "dbs:Scholar", "dbs:PUBMED" as needed in each collection. (select one article in the collection and add a new tag, then start typing on the tag box on the bottom left so that the new tag shows (thats why I used dps, so that all relevant tags will show), then press control+A to select all the articles, then drag and drop them on the tag).
Only then I went and merged all the duplicates. The result is that the duplicates will have both tags, so when I go into the Scholar group, I select the dbs:PUBMED tag and see all the articles that are present in both scollar and pubmed.
Problem is some duplicates are not recognised properly, so make sure you go on the main library and check them manually too.
2) .. how can I create a new collection formed of unique items from across all collections?
Now that they are all merged, you just create a new collection and copy paste from the database collections. The duplicates are considered one item and wont appear twice.
3) Some of my DBs (e.g. Google Scholar, ExLibrisPrimo) don't have an option to export *all* search results to Zotero (or even as an XML/XLS file). The 'Zotero Item Selector' button in Chrome allows me to only save as many items as the DB allows to fit on one results page, so only 10-or-so items at a time can be imported. Is there a better way?
For google scholar use https://harzing.com/resources/publish-or-perish its amazing, it gives you all google scholar results. Otherwise you cannot import too many because google stops you (too many connections or something)
For other ones, you could ask your university library if they offer this service.
Hope it helps,
Costas