Capitalisation and Titles with Multiple Languages

edited 26 days ago
Q: how can I stop Zotero capitalising titles in citations inappropriately?
A: put 'XX' in the Language field of the item in the database.


@dstillman recently brought my attention to the fact that you can use the Language field in the database to affect the way citations are presented. However I found no documentation on the precise way it works or what codes should be used, so I did some experiments. I share my results here, but speak with no authority and welcome additions and corrections.

I am aware of the discussion thread https://forums.zotero.org/discussion/5464/apa-citation-style-capitalization and I thank the contributors for the tips, especially the one about changing the import preferences. However, I find the sources (I use Worldcat a lot) are often wrong anyway. I am posting a new thread because I am not using APA, but MHRA3 (Modern Humanities Research Association standard version 3), which is also supported by Zotero. For the rules see http://www.mhra.org.uk/style. Actually, does the citation standard make any difference to Zotero’s capitalisation rules?


My problem is that I am writing a thesis in English that cites titles with two languages because I have the original title followed by an English translation, for example:
„Demokratie ist Lustig“: Der politische Künstler Joseph Beuys [“Democracy is Fun” Joseph Beuys the Political Artist]
I have carefully used the correct standards for both German and English parts so was annoyed when Zotero rendered the title in the citation as:
„Demokratie Ist Lustig“: Der Politische Künstler Joseph Beuys [“Democracy Is Fun” Joseph Beuys the Political Artist]

Oddly, it capitalised ‘Is’ but not ‘the’ in the English version and incorrectly capitalised ‘Ist’, ‘Lustig’ and ‘Politische’ in the German part. I thought I could do nothing about it until I heard about the Language field’s role in citations.

So what values to use for the language codes? Looking at what has been imported, I see all sorts, like En, eng, EN, for English; Ge, ger or DE for German, FR or fre for French and so on. I also see compound codes like ‘EN-US’. I have not found a list of codes for Zotero (is there one?) but found some standards online. The Library of Congress specifies three letter codes in English like ‘eng’ for English, ‘fre’ for French and ‘ger’ for German (https://www.loc.gov/marc/languages/language_code.html#e). The ISO standard allows up to four variants (ISO 639-1, ~2/T, ~2/B, and ~3) for two- or three-letter codes in English or the language itself: So English can be ‘en’ or ‘eng’; German ‘de’, ‘deu’ or ‘ger’; French ‘fr’, ‘fra’, or ‘fre’ (https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes).

What does Zotero do? After several experiments, I concluded that it seems only to distinguish between ‘English’ and ‘not English’. If the text is English, nearly all words are capitalised, except the first word and a few like ‘the’, of which it presumably has a list. Does the list only include English words? If the text is not English, then it is left as it is – which is what I want even for English titles. By experiment I found that a blank Language field is read as English; so English is the default. ‘en’, ‘eng’, ‘EN’ and the word ‘English’ are also accepted as meaning the title is in English. Capitalisation in the language codes seems to be ignored. ‘de’, ‘ger’, ‘fr’, ‘fre’, ‘fra’ and capitalised equivalents all seem to mean ‘not English’ and leave the capitalisation of the title in the citation alone. I also note that a random code like ‘XXX’ is read as ‘not English’. Am I underselling Zotero here? Actually this is fine by me; I’d rather write the titles myself than have Zotero try and figure out what is ‘right’.

Conclusion: to cite a title without change to the capitalisation put any code in the Language field that is NOT ‘en’, ‘eng’, or ‘English’. If in doubt, ‘XXX’ will do fine.

Happy citing!



P.S. If you are wondering how I typed the correct forms of the quotation marks into Zotero, I didn’t. I copied the imported title to WORD, set the language for each part of the text, re-typed the quotation marks, then copied and pasted it all back into Zotero.
  • The wall of text is a bit dense to read. What exactly is the problem you’re trying to solve, briefly?
  • edited 26 days ago
    See paragraph 3, the one starting "my problem is"...
  • ... and I did solve it; see the last paragraph starting "Conclusion..."
  • @bwiernik In response to your comment on the original text, I added the first two lines to the text, which put the question and the answer in a nutshell. Hope that helps!
  • @dstillman recently brought my attention to the fact that you can use the Language field in the database to affect the way citations are presented. However I found no documentation on the precise way it works or what codes should be used
    I linked to documentation in the same sentence where I said this was possible.
  • @AndySymons That is not a good practice. You should enter an actual language into the field or leave it blank. If there is an English language title with words that should never be capitalized, wrap those in <span class="nocase"> </span> tags.
  • edited 26 days ago
    ISO 639-3 has a code for multible languages 'mul', it also has a code for no-linguistic content 'zxx' and it has a code for undetermined 'und'... any of those would be better than XX or xxx; Because they would be accurate and would be valid BCP47 codes. https://tools.ietf.org/html/bcp47

    @AndySymons BCP47 uses all three ISO 639 standards and ties them together for IT uses. It is how we get the Locale IDs. -2 and -3 were synchronized some years ago, but -3 has more than -2 did so it became the default code standard. -1 was already in use widely by the IT industry, so BCP47 says something like:: if a language has a -1 use it, if a language doesn't use -3, but the same language might be written in different ways in different places, so when needed for specificity, add a country code. BCP47 spells this all out and where to get what codes.

    My understanding was that the Zotero Language field was for the language of content of the resource, the linked documentation seems to indicate that the language field is used as a field for linking to a CSL locale. Locale Identifiers are BCP47 compliant (with few exceptions, but new ones are to be BCP47 compliant), so is the correct answer on what goes in this field "BCP47 tags" for how the citation content will be styled? (or more narrowly: 'Choose a locale').
  • @dstillman Yes you did point to the documentation, which mentions some codes that can be used. I agree with most of what is said there, except that titles should 'usually be in sentence case'. With a mixed-language title, one has to stop all conversion, so all parts should be in the capitalisation appropriate to the language.
  • edited 26 days ago
    @bwiernik Sorry, the type of tagging you suggest is too cryptic for me; not easy enough to remember.
  • edited 26 days ago
    @hughp3 Thanks for telling me about BDP47. Yours is an excellent suggestion and I may well change my 'xx' codes to 'zxx', although this is still not strictly compliant with the BCP47 recommendations. What I really mean by 'xx' is 'this citation is in English but I want to fool Zotero into not changing the capitalisation'. Where the document is in German, say, I already use the 'de' code; it then does not change the capitalisation of any part of the title so the German and English parts have to be correctly capitalised in the title field.
  • One reason I want to stop Zotero changing my capitalisation is that it changes both the Title and the Series fields (possibly other fields too?). I often (mis-)use the Series field for additional text that I want in the citation, like "catalogue of an exhibition at the Staatliche Kunsthalle Baden-Baden, 6 December 1986 to 15 February 1987" and I don't want this all in capitals.
  • Using the multilingual language tag like that will prevent proper casing of English titles.

    I suggest you store that sort of information in Extra like this:
    Medium: catalogue...

    That will be picked up and formatted correctly by citation styles.

    I also recommend wrapping non-English text in the span tags I mentioned.
  • edited 26 days ago
    @bwiernik 'Using the multilingual language tag like that will prevent proper casing of English titles.' --> that's exactly what I want!

    'I suggest you store that sort of information in Extra like this...' I tried that, but the Extra field is not used in MHRA3 citations.
  • ... to be clear, I want the final citation to look like this:

    Hofer, Sigrid, ed., Entfesselte Form: fünfzig Jahre Frankfurter Quadriga [Unshackled Form: Fifty Years of the Frankfurt Quadriga Group], catalogue of an exhibition at the Städelsches Kunstinstitut and the Städtische Galerie Frankfurt am Main, 1 October 2002 to 5 January 2003 (Basel; Frankfurt am Main: Stroemfeld, 2002)

    ... with the title italicised; I don't know how to reproduce that here.
  • edited 18 days ago
    Is Hofer, Sigrid the editor of the catalogue or are they the collection curator? It is unclear to me if you are citing a book (the catalogue describing the collection) [by intention or because citing a collection in Zotero is hard and generally the collection catalogue is the only remnant of a collection after the exhibit is over] or if you are citing the collection itself... Citing an exhibit itself might be better done as a presentation. Have you looked at the fields for a presentation in your specific stylesheet?

    if Hofer, Sigrid is the curator and the catalogue editor, and you go the presentation type route then adding curator as a role in CSL 1.0.1 might be a good thing. @AndySymons you could add a comment to the pinned CSL item as they are asking for feedback a the moment.
  • I am citing the BOOK, which is itself the catalogue of an exhibition. That is precisely why I want the additional text "catalogue of an exhibition...". Apart from the additional text, the details are as for any other book. Sigrid Hofer is in this case the editor of the book, so correctly cited. She was quite likely the exhibition curator too, but that is irrelevant here.

    I do not see any need or way to cite the 'collection itself'; I can mention it in the text of course, but a citation is to something concrete that the reader can go and find for him/herself. I sometimes refer to a web page with installation shots or other details of a past exhibition, when available, but that is not the subject of the question in this thread.

    I don't use 'presentation' at all and its fields are not useful for an exhibition or art collection. I'm not sure what it is meant for. For a presentation at a seminar, I use the item type 'conference paper'. I'm not sure what other kind of presentation one might want to cite?
  • “Presentation” is for presentations at conferences and similar things. “Conference paper” is for papers formally published in a conference proceedings book or journal.
  • edited 18 days ago
    Conference Paper "A paper presented at a conference and subsequently published in a formal conference proceedings publication (e.g., as a book, report, or issue of a journal). For conference papers that have **not** been published in a proceedings, use `Presentation`." The zotero documentation is helpful in these sorts of cases: https://www.zotero.org/support/kb/item_types_and_fields — though I'm not alawys the first to know what is or isn't in the documentation...
  • Thanks to both @bwiernik and @hughp for those clarifications.
Sign In or Register to comment.