Localized "and"
Hi,
I know I can localize the text of the term "and", but is there a way to set a localized "symbol" for "and" (i.e., not "&")? If not, may I set different ("short/long") versions of the term?
I'm asking because common Chinese citation practice (here in Taiwan at least) prefers to use a special punctuation mark (a "light comma" with no direct English equivalent) instead of the "&" symbol to separate two authors. Currently I use <term name="and"> for "light comma", but that takes away the ability to have a true Chinese term for "and".
I've experimented with <term form="symbol" name="and">, <term form="short" name="and"> and <term form="long" name="and"> to no effect. Am I doing anything wrong?
I know I can localize the text of the term "and", but is there a way to set a localized "symbol" for "and" (i.e., not "&")? If not, may I set different ("short/long") versions of the term?
I'm asking because common Chinese citation practice (here in Taiwan at least) prefers to use a special punctuation mark (a "light comma" with no direct English equivalent) instead of the "&" symbol to separate two authors. Currently I use <term name="and"> for "light comma", but that takes away the ability to have a true Chinese term for "and".
I've experimented with <term form="symbol" name="and">, <term form="short" name="and"> and <term form="long" name="and"> to no effect. Am I doing anything wrong?
To answer your question: no, you can't localize the ampersand and it doesn't have a short or long form, although you can, obviously, localize the text "and" to anything you want, including a symbol.
This works for most cases, but when we have something that needs both "versions" of "and", we have a problem. Consider the following example, taken from the MLZ's CSL-m spec page (http://citationstylist.org/docs/citeproc-js-csl.html#affiliated-authors):
Clarke, Ministry of Fear and Smith & Brown, Large Corporation
Ideally, I would like to have the "light comma" in place for the ampersand, with a Chinese equivalent of "and" in place for the word "and". Currently the "light comma" is used in both places, and it's not pretty. (Imagine using ampersand in place of "and" for the example above.) Guess I have to live with it for now though.
edit:
in other words (though not having tested this), I'd expect
<names variable="author">
<name delimiter="; " initialize-with=". "/>
<institution and="text" delimiter=", ">
</names>
to return
Clarke, Ministry of Fear and Smith; Brown, Large Corporation.
(where I've chosen the semicolon to stand in for you special light comma).
Many Thanks. You have been amazingly responsive already. I wish I could code like you.
adamsmith:
Thanks for the suggestion. It doesn't work here, unfortunately. I'm using MLZ (newest version), not Zotero, if that makes a difference.
On 2nd try, indeed your method successfully replaced the ampersand with the delimiter. The problem remains, however, for I do need the regular comma. Using the same English example as the above, we get the following with your method:
S. Clark; Ministry of Fear and R. Smith; T. Brown; Large Corporation
See? The semicolon would take not only the ampersand between Smith and Brown, but also the comma between Clark and Ministry as well as that between Brown and Large. That's not acceptable, I'm afraid. Thanks for showing me how to avoid "and" altogether though. Appreciated.
葉俊榮,台大法律系與林子儀、簡資修,中研院法律所
* color code:
blue: personal names
green: institutional names
red: Chinese equivalent for the word "and"
orange: the "light comma" I keep talking about
black: Chinese comma.
** A Chinese comma is a "comma" taking the space of one Chinese character. (Chinese fonts are non-proportional in nature.) It can be substituted with ", " (regular comma followed by a space). The same applies to other Chinese punctuation marks with direct English equivalents, including the round brackets used below.
Screenshot for those without Chinese fonts installed:
https://docs.google.com/file/d/0B3F1SeOjlQYqQWF1WDh5aEEtT1E
For interviews (Bluebook R.17.2.5), it should be:
葉俊榮(台大法律系教授)與林子儀、簡資修(中研院法律所所長)訪談紀錄
* color code: brown - a Chinese expression meaning "interview record".
Again, screenshot at:
https://docs.google.com/file/d/0B3F1SeOjlQYqNm1DRTNCZmxzNTA
I'm putting the interviewees' title inside a pair of brackets because
1) we don't have a proper way to start the cite as "Interview with",
2) with the Chinese expression of "interview record" following the name list, there can be no delimiter in between, lest the cite becomes a document with the title "interview record" authored by those interviewees; and
3) since there's no comma separating the name list and the Chinese words for "interview record", it would be odd to use a comma between interviewees and their title.
One last note: there is really no standard citation rules for Taiwanese legal community. Each law journal has its own rules, but usually very sparse, covering usual suspects only, and often poorly enforced. I've never seen a rule for interview, let alone one for complicated multi-affiliated authors like we're discussing here. I'm therefore making up my own rules, trying my best to make these rarely used cases blending in well with the normal ones.
Coincidentally, I'm currently working a Japanese style for our faculty journal 『法政論集』. The situation here is similar to Taiwan, with loose conventions and sparse guidance documents. The main reference seems to be this 2004 document, which covers only the main item types.
I'm working through some basic forms right now. I'll try to build a test later today, and work out what best to do with mixed-element name listings.
<term name="and" form="symbol">、</term>
Using and="symbol" on cs:name or cs:institution will call the localised form.
This is form MLZ only at the moment: localisation of the "symbol" form of "and" is disabled in Zotero proper.
The version info. on the download page should be updated though. It still says [Ver. 4.0m398, 14 Oct 2013].
<term name="and" form="symbol">、</term>
, right?
The issue to watch would be with styles that redefine the unadorned form of the term:
<locale>
<terms>
<term name="and">und</term>
</terms>
</locale>
This happens in 23 repository styles. On the specification, it should affect only the default long form, and leave the other forms untouched (i.e. "or specific form of a term"). If that's not strictly honored (if the bare term definition wipes the slate clean for the target term) it would disable the symbol form of the term, which would be a problem.
I just checked, and citeproc-js conforms to the specification, so adding the symbol form and relying on it would be fine here. It might be worth checking with other processor developers to be double-plus-sure. It seems unlikely to be a problem, since over-aggressive redefinition of locale terms would probably cause issues in other contexts as well, but just in case.
In the meantime, if you haven't already done so, could you create a unit test to verify that defining the "symbol" form of the "and" term in cs:locale of a style works?
https://bitbucket.org/fbennett/citeproc-js/src/6991e152732e83dde218c06df3a3236caf6471c4/tests/fixtures/local/multilingual_LocalisedNameDelimiters.txt?at=default
Can be moved to the CSL test repo after the spec amendment goes through.
I'm asking because it seems like the "and" term is wrapped in hard-coded spaces in English.
Apple
תפוח
Orange
תפוז
Apple and Orange
תפוח ותפוז
For name spacing generally, it's suppressed according to the charset.
Do I understand correctly that we should add the & #8200; for the Hebrew locale as well?