Citations not collapsing
Hello,
I made some simple mods to the example in http://citationstyles.org/downloads/primer.html to add:
1. et al.
2. disambiguate-add-year-suffix
3. collapse
While the et al. and disambiguate are working fine, the citations are NOT collapsing. For example, I'm getting this:
(Colby et al. 2008a; Colby et al. 2008b; Colby et al. 2010)
instead of what I want:
(Colby et al. 2008a, 2008b; Colby et al. 2010)
Here's the gist with my edited file: http://gist.github.com/812042
This happens in both the test pane and in Word. I am using Zotero 2.1b5 with MacWord Integration 3.1b2. Is this a bug in the processor, or am I doing something wrong? Thanks!
John
I made some simple mods to the example in http://citationstyles.org/downloads/primer.html to add:
1. et al.
2. disambiguate-add-year-suffix
3. collapse
While the et al. and disambiguate are working fine, the citations are NOT collapsing. For example, I'm getting this:
(Colby et al. 2008a; Colby et al. 2008b; Colby et al. 2010)
instead of what I want:
(Colby et al. 2008a, 2008b; Colby et al. 2010)
Here's the gist with my edited file: http://gist.github.com/812042
This happens in both the test pane and in Word. I am using Zotero 2.1b5 with MacWord Integration 3.1b2. Is this a bug in the processor, or am I doing something wrong? Thanks!
John
And why wouldn't you expect (Colby et al. 2008a, 2008b, 2010)?
My mistake, (Colby et al. 2008a, 2008b, 2010) is what I expect. Thanks!
Could you try what happens when the hidden authors are identical?
It would be easy enough to adjust the processor to permit collapsing in this case. Let me touch the CSL list first, and see what other folks think.
(Edit: In passing, I'll note that the posted CSL code handles spacing badly. The year and author should be joined with a group element and a delimiter, rather than using a space prefix. Some of us had a hot discussion a little over month ago on the topic of whether the processor should rescue styles by suppressing spaces. I was on the should-rescue side of that one, and the citeproc-js processor does work very hard indeed to prevent duplicate spaces in output. I was surprised to see that this style code actually succeeds in sneaking past my handiwork, and produces a duplicate space when the author is dropped out of the citation. As the debate about space suppression was about saving extra work by rescuing existing styles, I'll draw a line here and say that the style should be fixed, not the processor. :)
Explanation (which we can add to the CSL spec if needed):
Year suffix indices are intended to disambiguate multiple items from the same author in the same year for author-date styles.
... where "author" is understood as a representation of the complete author list independent of any subsequent et al. shortening.
E.g. the processor should be seeing something like "doe-john:smith-jane:jones-david" internally for all sorting and disambiguation operations.
I'm not really understanding how it would realistically be any different.
But that's current behavior in both csl 0.8 and 1.0 and required by many styles (where names disambiguation is turned off).
So for disambiguation purposes, the processor already does ignore everything that's hidden by et al - and I think it makes sense if it did the same for collapse/sorting.
I would guess that the majority of styles would prefer if not require this practice.
But I don't really do CSL theory, so I might be off-base here.
The theory as I've loosely defined it is that a citation reference is an abbreviated pointer to a fuller description in the bibliography. So the suffix attaches to the bibliographic referernce, and is merely picked up by the citation reference.
Given publication practices in the natural and life sciences, I think year suffix disambiguation for these cases is crucial - it's not uncommon to have a paper with 20 authors, then some new postdoc joins the lab or contributed an image or whatever and the next paper has the same 10 authors, but then some others - so how else would you disambiguate these citations in the text? By adding 11 authors?
(I see ajlyon just made the same point).
And I don't see any issue in sorting - the author names can still determine which of the two articles is "a" and which "b" - it does that currently and it's never been a problem.
I stand by my argument that, at least in the majority of the social sciences, "Jane Doe and John Smith" are treated as different authors than "Jane Doe and John Jones". This means that they get separately listed and indexed in the bibliography:
Doe, J and J. Smith (2002a) ...
——— (2002b) ...
Doe, J and J. Jones (2002a) ...
——— (2002b) ...
If a processor did not do that, I'd call it a bug.
Now if that doesn't work for some styles (e.g. in the sciences), then I'd suggest we add a parameter to toggle behavior (and update the test suite to deal with this).
I see only two options - year suffix (i.e. Colby et al. 2008a and Colby et al 2008b) or added names (Colby, Smith, et al. 2008; Colby, Meyer, et al. 2008). Added names are already possible in csl and overrule year suffix, so if that's your concern it's already working.
But surely you can't be advocating ambiguous pointers - having two Colby et al 2008 in the text and having the reader guess?
But I work in the social sciences, where long authors lists are exceedingly uncommon. So I've not personally come across this issue.
Those also wouldn't collapse - which brings us back to the original question - thanks to Rintze for asking APA - I think the worst case here is indeed that styles disagree, and if experience is any indication they will. Maybe John Colby is following a specific style guide or has an example?
http://www.elsevier.com/locate/inca/522773/authorinstructions
They want et-al-min=1, disambiguation with year-suffix, and collapse=year. The disambiguation/collapse should happen even if the hidden authors differ (so long as the first authors are the same). The is pretty common in Elsevier journals. Thanks, John
Thanks for pressing on this.
I think for styles that don't use given name disambiguation, the collapse rule as described by John Colby (and implemented in csl. 0.81 and now 1.0) makes most sense - and APA's response just highlights what Bruce has been saying all along - that this isn't even relevant in most social science styles, because they would disambiguate differently.
Sure thing. Here is one from Psychiatry Research, and another from NeuroImage.
(look for the multiple "Chang", "Moeller", and "Volkow" refs)
Alicata, D., Chang, L., Cloak, C., Abe, K., & Ernst, T. (2009). Higher diffusion in striatum and lower fractional anisotropy in white matter of methamphetamine users. Psychiatry research, 174(1), 1-8. doi:10.1016/j.pscychresns.2009.03.011
(look for the multiple "Hasan" refs)
Lebel, C., Caverhill-Godkewitsch, S., & Beaulieu, C. (2010). Age-related regional variations of the corpus callosum identified by diffusion tensor tractography. NeuroImage. doi:10.1016/j.neuroimage.2010.03.072
Here are the files too: http://dl.dropbox.com/u/4303627/outbox/cite_collapse_examples.zip
Thanks for all the continued support,
John