SSOAR translator?
SSOAR <http://ssoar.info/> is an open access repository for the social sciences. It already exports BibTeX and Endnote, so importing entries into Zotero is not that difficult. But having a native translator would be great, especially in order to directly download the availabe PDF documents. Would anybody be willing to contribute a translator for the site?
But for the page you link to, e.g. - even if we patch RDF to recognize DC.type content="incollection" as a book section - the data in the bibtex is much better/clearer than what's in the site header. So using the bibtex seems worthwhile.
In the meta tags there is also the pages and ISBN saved, maybe one should move them to another field. I could speak with the GESIS about such things. What do you think?
Obviously if we could just get better data in the site header w/o a translator, that'd be ideal, yes. The DC vocabulary, IIRC, isn't great for that - adding google highwire metatags would probably produce the clearest data, but even with DC they could likely be clearer.
As things are now, using the metadata is too much of a guessing game (e.g. both publisher and book title are in DC.source etc.)
I have noticed that the SSOAR import could possibly improved. When importing data from:
http://www.ssoar.info/ssoar/handle/document/46968
I do not get the DOI and tags (e.g. Thesaurusschlagwörter, Klassifikation, Freie Schlagwörter).
Do you think this can be included in the SSOAR site translator?
meta name="DC.identifier" content="http://dx.doi.org/10.17645/si.v1i1.109" xml:lang="de"
Sometimes even when the DOI is provided on the web page of an item it isn't in the meta tag information.
edit
Let me add, however, that I almost never download the article metadata from this site but follow the link to the article on the publisher's site. The publisher's site almost always contains more recent information while the SSOAR site metadata is missing volume, issue, page information.
However, taking all tags looks too much and not useful anymore. The examples shows Thesaurusschlagwörter, Klassifikation, Freie Schlagwörter moreover, and they are present in English and German:
<meta name="DC.subject" content="Allgemeine Soziologie, Makrosoziologie, spezielle Theorien und Schulen, Entwicklung und Geschichte der Soziologie" xml:lang="de" />
<meta name="DC.subject" content="General Sociology, Basic Research, General Concepts and History of Sociology, Sociological Theories" xml:lang="en" />
<meta name="DC.subject" content="Quantifizierung" xml:lang="de" />
<meta name="DC.subject" content="quantification" xml:lang="en" />
<meta name="DC.subject" content="Frankreich" xml:lang="de" />
<meta name="DC.subject" content="France" xml:lang="en" />
<meta name="DC.subject" content="Klassifikation" xml:lang="de" />
<meta name="DC.subject" content="classification" xml:lang="en" />
<meta name="DC.subject" content="Konvention" xml:lang="de" />
<meta name="DC.subject" content="convention" xml:lang="en" />
<meta name="DC.subject" content="Institution" xml:lang="de" />
<meta name="DC.subject" content="institution" xml:lang="en" />
<meta name="DC.subject" content="Neoliberalismus" xml:lang="de" />
<meta name="DC.subject" content="neoliberalism" xml:lang="en" />
<meta name="DC.subject" content="sozialer Prozess" xml:lang="de" />
<meta name="DC.subject" content="social process" xml:lang="en" />
<meta name="DC.subject" content="Institutionstheorie" xml:lang="de" />
<meta name="DC.subject" content="theory of institutions" xml:lang="en" />
<meta name="DC.subject" content="Institutionenökonomie" xml:lang="de" />
<meta name="DC.subject" content="institutional economics" xml:lang="en" />
<meta name="DC.subject" content="Forschungsansatz" xml:lang="de" />
<meta name="DC.subject" content="research approach" xml:lang="en" />
I don't see any (easy) option to just grab a handful non-repeating tags from these.