alphabetize records without A or The
This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.
So let's leave this aside and return to regular stop words.
All MARC (Machine Readable Cataloging) Records generated by Libraries the world over, include for each title field a code, called and indicator, which tells the program how many spaces to skip in the title field before alphabetical indexing begins. Thus, any title with The as the first word would have an Indicator of 4 so that the computer will skip the first 4 characters, including the empty space before the actual indexing begins.
Your software could be designed to ignore the indicator in those instances when a Style requires the alphabetization to include leading articles (in any language).
If you fail to add an indicator the leading article does not get skipped. People can be trained to do this. I was, 30 years ago.
This should probably happen in a fairly automated way. Possibly with a Pre-set list of stopwords that's editable. The lists probably need to be linked to the language of the entry. Then it probably needs to be possible to turn it on and off in citation styles. None of this is undoable, but it's not an easy problem.
This reinforces the importance of standard 2-character or 2-2-character language abbreviations. This has implications for translators and my hope that eventually 3-character ISO abbreviations from PubMed and full-word language labels from some publishers will soon be automatically converted to ISO 631-1.
citeproc-js
) recognizes an extended attribute on localestyle-options
for this purpose. The attribute, documented here, is active in Zotero, but it is not part of the CSL specification.(Styles that use the attribute should work correctly in Zotero, but because they will not validate, they will issue a warning when installed, and cannot be added to the CSL repository. They may also not work correctly on other platforms.)
***
Yes, it's tied to the locale, so it should adapt to the language set on each item (falling back to the style's bare locale if any, the style default-locale, or to us, in that order).
Is that something that varies across styles and so would need to be configurable in the style specifications?
Zotero has matured greatly over the last decade, and I am thrilled with how well it has worked for me. Thank you so much to the support staff for all your help!
* Add "Original Publisher: Open University" to the Extra field in Zotero so this gets passed as the CSL variable original-publisher
* Add this macro to my style:
<macro name="author-sort">
<choose>
<if match="any" variable="original-publisher">
<text variable="original-publisher"/>
</if>
<else>
<text macro="author"/>
</else>
</choose>
</macro>
* Modify the bibliography sort to use author-sort instead of author
<bibliography>
<sort>
<key macro="author-sort"/>
<key macro="year-date"/>
<key variable="title"/>
...
This doesn't impact the way author is displayed, but whenever original publisher is set, this is used for sorting instead of the actual author. This could be used for any situation where the sort order should not use the author names as they are displayed.
I hope this helps someone else until a more permanent solution is found.
It should be a very good solution for this problem.