Abstract not scraped from SpringerLink
http://www.springerlink.com/content/u927215068044412/
The abstract was not scraped from the above article.
The abstract was not scraped from the above article.
This is an old discussion that has not been active in a long time. Instead of commenting here, you should start a new discussion. If you think the content of this discussion is still relevant, you can link to it from your new discussion.
With a freshly-updated copy of Zotero, there are a few problems with the springerlink scraper for the above article:
* The authors' first names are not scraped: I get "Nandivada, (first)" in Zotero (along with the other two authors), and "Nandivada and Pereira and Palsberg" in BibTeX export, which leads me to suspect the names aren't being scraped
* The scraped item is a Web Page in Zotero, and a @misc in BibTeX, when it should be a conference paper -- the RIS "TY" field says Chapter, and it's being ignored
* The URL in the RIS file is a DOI entry (recognizable because it has dx.doi.org in it), and should be filed as such.
* (Possibly unrelated) The BibTeX export of the scraped entry doesn't include all the fields stored by Zotero -- the URL field, for instance, should be maintained since some citation formats include the URL in the citation.
I don't know much about Zotero's APIs, but I could try writing a new version of the scraper to fix these...
http://forums.zotero.org/discussion/969#Item_10
DOI has a ticket open at:
https://www.zotero.org/trac/ticket/684
(but DOI support for other types might need to be added)
To fix authors not appearing correctly, I changed the body of the following loop in the SpringerLink scraper:
// fix incorrect authors
var oldCreators = item.creators;
item.creators = new Array();
for each(var creator in oldCreators) {
item.creators.push(Zotero.Utilities.cleanAuthor(creator.firstName + " " + creator.lastName, creator.creatorType));
}
I'm not quite sure why the incorrect authors fixes are needed at all -- if I delete this loop altogether, it works fine for me... In any case, changing "author" to creator.creatorType is probably right; including the creator.firstName is what was snarling the import, since it's correct in the RIS data. Are there counterexamples where this change now breaks formerly working pages?
To fix the bookChapter problem, I changed the condition in the following if-test in the RIS scraper:
// first check typeMap
for(var i in typeMap) {
Zotero.debug(i);
if(value.substring(0,typeMap[i].length) == typeMap[i]) {
item.itemType = i;
}
}
The problem is typeMap["bookChapter"] == "CHAP", and the RIS contains "CHAPTER"; this test allows for partial matches.
hope this helps...
However, if bierner's fixes work, then please do incorporate a work-around until Springer does something. They are a major resource, and isn't the strength of Zotero that these scrapers can be adapted quickly?