Zotero ignores locale settings
Ok, I spent the better part of the last two days trying to figure out how to force Zotero to use my locale settings with Chicago full note & bibliography.
My firefox' locale is Polish, Zotero uses Polish wherever possible. Yet in footnotes and bibliographies I still see some remnants of English. For instace, in the edition field of Zotero I put simply the number 3. Yet in footnote the work looks as follows:
Mękarski, Stefan. "Lwów: karta z dziejów Polski". 3rd wyd. Londyn: Koło Lwowian, 1982.
"3rd wyd" stands for "3rd edition", but the rd is clearly some English ordinal ending. Needless to say, I don't need it in my Polish language work.
The problem is also visible in the case of dates and date formatting. I was able to manually modify the files to force Zotero to use D-M-Y format, but not the correct month names used by the Polish language (those in csl file are fine, but Zotero apparently doesn't use them). (See here for details: http://forums.zotero.org/discussion/6797/zotero-appearance-how-do-i-change-date-format-in-date-added/#Item_18)
My firefox' locale is Polish, Zotero uses Polish wherever possible. Yet in footnotes and bibliographies I still see some remnants of English. For instace, in the edition field of Zotero I put simply the number 3. Yet in footnote the work looks as follows:
Mękarski, Stefan. "Lwów: karta z dziejów Polski". 3rd wyd. Londyn: Koło Lwowian, 1982.
"3rd wyd" stands for "3rd edition", but the rd is clearly some English ordinal ending. Needless to say, I don't need it in my Polish language work.
The problem is also visible in the case of dates and date formatting. I was able to manually modify the files to force Zotero to use D-M-Y format, but not the correct month names used by the Polish language (those in csl file are fine, but Zotero apparently doesn't use them). (See here for details: http://forums.zotero.org/discussion/6797/zotero-appearance-how-do-i-change-date-format-in-date-added/#Item_18)
Below is my working version of locales-pl-PL.xml. It's by no means complete as I wasn't sure about all of the entries and in some cases the xml language is too simplistic to work fine with Polish grammar. Part of the problem is that we have both grammatical gender (masc./fem./neutr.) AND declension in Polish. Which means that for instance "presented at" term could be:
zaprezentowano
But this is but a workaround, as it should be zaprezentowana (feminine) if we're speaking of, say, work or paper, which are feminine in Polish, but zaprezentowany (in case of, say, lecture, which is masculine in Polish). To add to the confusion, all forms have also their plural forms.
The same problem is with month names. I believe I mentioned that in the other thread, in short: in Polish, when giving the full date, you use accusative case and not nominative. However, as ajlyon pointed out in that thread, nominative might also be needed, in the case when there is no full date and instead only month and year are known (not that this was the case with many Polish language publications, as books are recorded by year mostly here).
Also the ordinal numbers ("first" is "pierwszy" when masculine, but "pierwsza" when feminine, "pierwsze" if plural and so on). That's why I chose a workaround as it's customary to add a dot after a numeral (say, 1., 2., 3. and so on) to mark it as an ordinal number. Hence 1. stands for both pierwszy, pierwsza and pierwsze.
Finally, some clauses are almost always followed by certain punctuation marks in Polish academic references. For instance "w" ("in") is always followed by a colon ("How to get there" w: "To the Moon!", 1998). Not sure if such a colon is still in accordance with some styles (Chicago for instance) as they might be well-suited to much less grammar-rich languages.
Also, I wasn't sure what the "editor" fields stand for. If it's a journal editor (as in: editor in chief) or an editor of an academic series, then it should be "redaktor" (plural: "redaktorzy"; short form: "red.". However, if it's the person to review an academic publication (say, a phd thesis), then it should be "recenzent" (pl:"recenzenci", short: "rec.").
All in all, the abovementioned changes allowed me to create some sort of a Polish localisation of Chicago rather than a direct translation. I marked all the lines I didn't translate for the reasons stated above with hashes.
Anyway, here's the file:
https://gist.github.com/1064418
And the bottom line is that, apart from isolated cases, most professors here in Poland set one basic rule of thumb: in the end it doesn't really matter which citation system you use as long as you're consistent.
Cheers
Thanks for this report.
Of course you are right that in case of pages the problem is obscured by abbreviation. Furthermore, I'm not sure about csl or xml in general, but I have had some experience with gettext files (translating .php themes using built-in gettext engine) and it seems that it handled Slavic numerals just fine, using some sort of matematical formula that defined when to use which form. A great example: a comment in Polish is "komentarz". In English the plurals are formed by adding s. As simple as it is: 1 comment, 2 comments, 3 comments and so on. In Polish (and most if not all Slavic languages), there are different endings for 1, 2 to 4 and 5 to infinity. Using gettext I was able to point php files to a correct form. Take a look at any Polish-language Wordpress-based blog: it features a counter saying 1 komentarz, but 2 komentarze and 10 komentarzy. Something similar could be achieved in Zotero if need be.
But is it really needed? IMO in the case of singular-plural for pages - no. However, in the case of, say columns - there is a problem. The abbreviation "kol." is already reserved in Polish for "kolega" (meaning colleague). So we can't create artificial abbreviations, we should use either "kolumna I" (column 1), kolumny II-III (columns 2-3) or "na kolumnie 15" (at column 15), depending on the context and citation style. Not a perfect example, but I can't think of any better right now, sorry.
As to genders, inflection and such, computers are English-centric, so short-cuts are acceptable in modern Polish publications (such as "zaprezentowano" I mentioned before: it could be translated as "was presented", without specifying the subject ("it") as it would require a gender-specific suffix). Another example of such short-cut used in citations is the term for access date (as in "this web-page was accessed on January 2nd, 2003"). It should be "udostępniono/udostępniona/udostępnione" (gender and numer-specific forms), but "dostęp" (literally: access) is often used. It is acceptable nowadays, which doesn't make it look less awkward though.
Oh, just noticed one more problem with English vs. ROTW: what has only one name in English could have many context-dependent names in other languages. Take the word "book" for instance. Normally (in the case of "book as a reference" instead of "chapter" or "web page"), it should be translated as "książka". However, when talking of, say, Book of Job from the Bible, or Book VII of Iliad, the correct term is "księga". So what is one string for the English translation, should actually be two strings for the Polish, both context-dependent.
As to short forms - do you mean "Jan, Feb, Mar, Apr," and so on? Yup, these are used, but I can't remember seeing such abbreviations anywhere outside of calendars. Certainly not in academic citations. When by "short form names" you mean names in nominative case, then Polish does use them. I believe I already mentioned (here or in the other thread), the reference could be:
Doe, John (1 stycznia 1945) "Blahblahblah" - the month name is in Accusative
but
Doe, John (styczeń 1945) "Blahblahblah" - the month name is in Nominative
The latter case is fine as long as the day is missing. So in theory the latter form is needed. In practice one rarely see such references in Polish publications. Partly because we have no tradition of citing daily publication date for anything except for daily newspapers and weeklies, usually year is enough. And parly because if it is a monthly, its' number is enough to tell which month it was.
@fbennett - sure, I figured I could hard-code Polish usage into the files. So my personal Chicago is no longer compatible with English usage (01.02.2003 date format instead of 1 February 2003, for instance), but it works fine for my needs. One has to bear in mind however that translating Zotero into different languages is a tad more complicated than "take one term and check its' meaning in Polish".
Cheers
http://validator.nu/?doc=https%3A%2F%2Fgist.github.com%2Fraw%2F1064418%2F29a17654a6d7c24b42c2ae16cfb2b117a544e2a3%2Flocales-pl-PL.xml&schema=https%3A%2F%2Fgithub.com%2Fcitation-style-language%2Fschema%2Fraw%2Fv1.0%2Fcsl.rnc&parser=xml&laxtype=yes&showsource=yes
Also, some of the problems seem to be related to the fact that Zotero tries to be less English-centric than the CSL. Hence the document has fields for more ordinals than 4, while CSL document lists additional ordinals as errors. Just remove the lines after
"term name="ordinal-04" and it should validate nicely. As long as Zotero uses ordinal-04 the English way (adding th to anything above 4), it should work just as well.
Anyway, I'm not an expert on xml or coding or whatnot, just an educated guess. From my perspective the document is okay as it works, which is all that matters to me :)