Getting Journal Abbreviations from a repository
http://forums.zotero.org/discussion/5370/wiley-and-science-direct-journal-ab/#Item_3
Strangely enough, was not widely discussed yet. EndNote offers this feature, the open source Jabref also. There exist several publicly available repositories.
Would be nice to have the functionality to automatically insert/replace abbreviations in a selected collection in Zotero.
Strangely enough, was not widely discussed yet. EndNote offers this feature, the open source Jabref also. There exist several publicly available repositories.
Would be nice to have the functionality to automatically insert/replace abbreviations in a selected collection in Zotero.
- http://forums.zotero.org/discussion/3877/automatic-generation-of-publication-abbreviated-names/
- http://forums.zotero.org/discussion/8278/text-substitution/
- http://forums.zotero.org/discussion/6954/bibtex-export-doesnt-contain-journal-abbreviations/
I doubt Endnote or JabRef covers the issues discussed in these threads. To my great regret, this issue is more complex and ugly than I would have thought.Actually - which issue isn't covered by the jabref/Endnote approach?
I imagine (intuitively) a pretty simple routine: in the style instead of just
format="short" - you write format="short-isi" - and then the style automatically replaces journal names with abbreviations in the isi list, leaving the ones it doesn't have an abbreviation for with their full name (which is what most styles request anyway).
I'm a social scientist and we don't do journal abbreviations much, but for medicine and other natural sciences this seems super standard and important. I've edited a whole bunch of biology and medicine styles over the last weeks, and use journal abbreviations from "such and such list" has always figured prominently in the style guide.
In that case, I'd probably imagine using a URI to identify the abbreviation list.
Right now the periodicals db seems very far from providing a solution.
When you raised the issue in August people really seemed at a bit of a loss of how to deal with it:
http://groups.google.com/group/dataincubator/browse_thread/thread/1c89d436e07fe6b7
There is a limited number of such lists that are actually being used. Those could also just be included in Zotero and things would just work (that's what endnote does, and, dare I say it, I think does well). For people who do require "obscure list" they could be imported separately (once again, as endnote does).
I don't think a status quo that requires pretty much every "hard" scientist to manually edit 80% of his/her bibliography should exist for any longer than absolutely necessary.
That's not to say that I am afflicted with a chronic belief that this is the right way for things to work, but it's the way things are set up in the test-bed processor at the moment.
PS: Everything in the first paragraph above, from "As far as I know ..." onward is mistaken, as Rintze points out. See my next post below for a corrected outline of processor behavior.
Since, contrary to what I wrote, there is a field for journal abbreviations, and since form="short" does have meaning in current Zotero, the supplied abbreviation list should only be used when form="short" is set, and the fallback should be to journalAbbreviation, then to the full title. Sorry for my earlier confusion.
This only solves the mechanical problem of applying abbreviations from an external list. It leaves open the question of how to manage and select the lists themselves (whether by an abbreviation-list attribute or by some mechanism external to CSL markup).
http://forums.zotero.org/discussion/8278/text-substitution/?Focus=37489#Comment_37489
Dan or Simon: any opinions on this?
Basic issue is that if you go with the approach Frank used here (a simple substitution list untied to CSL), you'd still need some way for a user to load or specify those lists. The user would be responsible for all of this.
OTOH, using a URI would allow lists to be defined by style, and so for it to be essentially transparent to users.
Two options that come to mind, both starting with the CSL file having some sort of processing instruction or embedded attribute/element that includes a URI:
1) The UA reads the URI, fetches the data, and provides it to the processor using the mechanism Frank already has in place.
2) The UA reads the URI, fetches the data, and preprocesses the CSL file, substituting in the data as XML data. This might make for easier testing and would allow styles to hard-code lists if they really wanted to, though there might not be much of an advantage to that. But I think Ant does something like this.
3) Abbreviation lists are stored in a separate folder in the same directory as the styles folder. If bandwidth isn't an issue, standard abbreviation lists can be shipped along with the CSL-application (e.g. Zotero). New lists can be installed by hand by placing them in the abbreviation list folder. If a new style is installed that uses a new abbreviation list, that list is downloaded as well. The CSL-application takes care of automatic updating of the abbreviation lists (just like styles). In this scheme it would be enough if the CSL processor has access to the local abbreviation list folder.
but from a style-coding and using perspective that's exactly what I expect.
and if is possible that will also be in line with Bruce's concern that the user shouldn't have to import any lists.
When a user installs a new style that requires an abbreviation list, the installation routine would automatically check if that list is already installed in zotero (i.e. the lit file present in the directory) and otherwise download the list together with the style.
Maybe I'm naive about the easiness of doing that from a programming point of view, but I think this would be ideal from a user point of view. I imagine the current install routine checks where the style folder is in zotero, checks if the style already exists there (it distinguishes between install and update) and the installs the style. Writing an additional loop that checks if and what abbreviation list is required for the style wouldn't seem that much harder.
<abbreviation-list>
<info>
<title>BIOSIS</title>
<id>http://www.zotero.org/abbreviation-lists/biosis</id>
<link href="http://www.zotero.org/abbreviation-lists/biosis"/>
</info>
<journal>
<full-title>Journal A</full-title>
<abbreviation>J. A</abbreviation>
</journal>
<journal>
<full-title>Journal B</full-title>
<abbreviation>J. B</abbreviation>
</journal>
...
</abbreviation-list>
Storing the abbreviation lists in JSON might perhaps make more sense, as many lists contain close to 10,000 journals (http://www.library.illinois.edu/biotech/j-abbrev.html).
1) a list of 10,000 items is insane. Why not just specify abbreviation algorithms as described in that illinois link ("List of Title Word Abbreviations")?
2) I don't see any great advantage using JSON here; the XML might be as simple as:
<substitutions>
<substitute match="Test Journal" replace="TJ"/>
</substitutions>
Test Journal = TJ
. I see few advantages of inventing our own XML unless we have specific ideas for taking advantage of extensibility.Since the list of actual titles for this case is already out there, and since other styles do have abbreviation requirements that cannot be covered by simple word lists alone, it seems like providing for word-substitution in this case, but for title-substitution in others would add unnecessary complexity.
PS: Uh, also what noksagt said.
So for sake of argument, why should this be a matter for the CSL processor at all, except for the fact that it might be nice (though not essential) to be able to specify the list in the style? Seems that clients like Zotero could do the substitutions before sending over the data to process.
Being able to specify a URI to a list of abbreviations to use & having a common file format for such a list seems very simple to put in the specification. Assembling those lists and, to a lesser extent, processing those lists seems harder. If Frank thinks that he can do the latter, I don't see any downsides to putting it in.
If we don't put it in the spec, how are the clients going to decide which abbreviation list to use with each style?
I also don't see why a CSL processor should be responsible for this. JabRef, for example, does the substitution; not BibTeX? Leave it to the user to determine (which admittedly, puts the burden on the client app to support their making this choice).
It might be possible to add a simple attribute for this without adding significant risk or added work; will think on it.
I think the reason this should be in the csl is that the direction which abbreviation to use are in the style guide - so when translating the style guide to a csl that information should be included.
AACN Clinical Issues AACN Clin. Issues AACN Clin Issues
AADE Editors Journal AADE Ed. J. AADE Ed J
See http://www.library.uq.edu.au/endnote/journal_terms.html Agreed. Nobody asked for this to be included in 1.0.
If we consider abbreviation part of CSL (and I'm not convinced we do), then the format should probably be XML, given that CSL is an XML format.
If we consider it part of the client app, then I don't care.