Separate fields for Title and Subtitle
Currently, Zotero provides only one field for the title of the work. But often, a title consists of a main title and a subtitle. I think Zotero should provide separate fields for both.
Why is this important? This is mainly important for consistent output regarding punctuation. In German, it is common to use a period between the main title and the subtitle. In English, I think a colon or a semicolon are used. This is also brought up in this thread. There, it is recommended to do more parsing on certain fields. Anyway, one would come up with two separate fields in CSL. I think having two separate fields in the data entry mask is easier than parsing title strings.
For some bibliography styles, it is important to have control over the formating of the title/subtitle. MODS already has separate fields for main title and subtitle, as does biblatex.
Cheers,
Frederik
Why is this important? This is mainly important for consistent output regarding punctuation. In German, it is common to use a period between the main title and the subtitle. In English, I think a colon or a semicolon are used. This is also brought up in this thread. There, it is recommended to do more parsing on certain fields. Anyway, one would come up with two separate fields in CSL. I think having two separate fields in the data entry mask is easier than parsing title strings.
For some bibliography styles, it is important to have control over the formating of the title/subtitle. MODS already has separate fields for main title and subtitle, as does biblatex.
Cheers,
Frederik
Also, both CSL, Zotero, and BIBO (Zotero's preferred import/export format) all have support for a notion of a "short title", which is typically the title sans subtitle.
Personally, I always replaced the colons from imports with periods. But this does only work until you don’t publish in English journals, where the colon is common. I stumbled upon this issue since my boss just told me that this was the main stopper preventing him from switching to Zotero. So I thought I’d bring it up here. And that other thread shows that at least one other person already came up with it. Most of them are rather old formats. They are well established, but don’t cover all use cases, especially from the humanities. MODS is quite complex, but also very feature-complete. And biblatex (and other BibTeX-derivatives) was invented to overcome some limits of plain BibTeX which is rather a least common denominator. But it doesn’t help you to format the delimiter. And it doesn’t always hold true: In history and related disciplines, in subsequent citations, a short title is used which is typically shorter than the main title (Example). You could, for example, have »Religionsgeschichte Deutschlands im 19. und 20. Jahrhundert« as the main title, »Religionswissenschaftliche Überlegungen« as the subtitle, and »Religionsgeschichte Deutschlands« as the short title.
here a couple of citations from a recent article (Höpner, M.; Jackson, G. Das deutsche System der Corporate Governance zwischen Persistenz und Konvergenz. KZfSS Kölner Zeitschrift für Soziologie und Sozialpsychologie 2002, 54, 362-368).
There is no pattern here. Seems like they are OK with whatever the original title is. Which seems eminently reasonable to me.
Jackson, Gregory, 2001: Organizing the Firm: Corporate Governance in Germany and Japan, 1870–2000. Dissertation, New York, NY: Columbia University.
Jong, Henk Wouter de, 1997: The Governance Structure and Performance of Large European Corporations, The Journal of Management and Governance 1: 5–27.
Kurdelbusch, Antje, 2002: The Rise of Variable Pay in Germany. Evidence and Explanations. In: Anthony Ferner (Hg.): Special Issue of „European Journal of Industrial Relations“ on Multinational Companies and Globalisation (im Erscheinen).
Thelen, Kathleen, 2002: How Institutions Evolve: Insights from Comparative-Historical Analysis. In: James Mahoney und Dietrich Rueschemeyer (Hg.): Comparative-Historical Analysis: Innovations in Theory and Method. New York: Cambridge University Press (im Erscheinen).
Thelen, Kathleen, und Sven Steinmo, 1992: Historical Institutionalism in Comparative Politics. S. 1–32 in: Sven Steinmo, Kathleen Thelen und Frank Longstreth (Hg.): Structuring Politics. Historical Institutionalism in Comparative Analysis. Cambridge, MA: Cambridge University Press,.
Wójcik, Dariusz, 2001: Change in the German Model of Corporate Governance: Evidence from Blockholdings, 1997–2001. Oxford: School of Geography and Environment, University of Oxford (unveröffentlicht).
Maybe this example helps: http://www.hendrik.maekeler.eu/zitierrichtlinien.de.htm The last sentence states: »A period is put between main title and subtitle.« Short cites contain only the first meaningful noun, which I guess would count as a short title.
We're all keen to see title/subtitle correctly supported for German publications; the question is just how best to go about it. As Bruce points out, some pretty sweeping changes would be necessary to split titles into two fields, and then correctly handle the result. Titles from most repositories would need to be correctly split (i.e. when the translator is run to capture the cite data), and during manual entry, users would need to use the two fields correctly. The change might actually make things less reliable overall, rather than more. It would certainly mean more typing for everyone.
If a single title field can be split reliably into the correct two parts by the CSL processor, we can accomplish the same result (i.e. printing a period between title and subtitle in German styles, and a colon in English styles), without any changes to the database or to Zotero -- and fewer changes to existing kit means an earlier solution.
At the moment, Zotero doesn't do this, so to cope, you have curated your database with periods between the two elements. Many other German-speaking users will have done the same. It might be that a colon provides a more reliable hint that the field should be split, but let's start by looking at whether there are titles that _cannot_ be split correctly by breaking on the first period. Looking at your database, are there any such titles?
If you can identify "difficult" titles, let's see if we can work out a way to handle the split correctly. If that can be settled, the next step is easy -- we'll add the title delimiter issue to the todo list for CSL development, and once it's been added to the CSL specfication, I'll add the necessary support to the upcoming CSL processor for Zotero.
Meanwhile, please tell your boss that we're interested in resolving this issue for German users, but that splitting the title across two fields in the database mask might not be part of the solution.
Thanks for this additional info. There seem to be journals out there that require the title/subtitle delimiter to be localized. One of the devs wrote in 2007: So there is at least anecdotal evidence that this is a real issue.
thanks for your reply. I fully understand that introducing a new field has quite much impact, and so it might not be worth the trouble. I thought that having separate fields is an easier solution in the end, and that it also could be used optional (if one puts everything into the title field, it doesn’t matter as long as you don’t need the features discussed here). But I might be perfectly wrong, it was just my view on the issue.
And I think that it is more valuable to have a well-thought-out solution in the end, than making quick changes which might turn out as inappropriate. So everything I write is pure brainstorming, everything can be dismissed in the end.
What I think should be considered when trying to solve this issue:
- It might not only be about punctuation. I have seen cases where, e.g., the main title is printed in italics, while the subtitle is printed normal. This is probably only an issue for strange German traditions, but one might consider this.
- A title might have more than main title and subtitle, but also a title addon. This is the case for Festschriften. (see this example). There you have the main title (Das Auge Des Betrachters), a subtitle (Beitrage Zum Konstruktivismus) and a title addon (Festschrift für Heinz von Foerster). This is not so uncommon in German publications.
I skipped through my library, and it seems that splitting at the first period is in deed problematic in some cases, e.g., abrreviations, like this: »Immigrant Religion in the U.S. and Western Europe: Bridge or Barrier to Inclusion?«. Splitting at the first period would split after U. Considering period-space as a delimiter, one would split after U.S. In this case, one could split after a period only if the title contains no colon, but what for German titles with abbreviations and periods as delimiter?Also, question marks can be used as delimiters, as in »Ghettos oder ethnische Kolonien? Entwicklungschancen von Stadtteilen mit hohem Zuwandereranteil«, but I think this is also true for English resources and should be less problematic.
But I think it would be better to migrate the library once to using only a colon as a delimiter than having an unreliable solution. But this means that it must be known that the colon has a special meaning, and that one should not use a period in titles. That relates to the issue of how one could teach people about special »micro-syntax« used in Zotero’s fields for parsing, which was raised in the topic I linked in my first post.
I hope I didn’t come up with too much to think about. Just ignore whatever you consider irrelevant with regard to the original issue.
Cheers,
Frederik
I would also be strongly against a feature that requires users to know that colons have some special meaning - that sounds like a road to user unfriendliness.
If we need to inconvenience many users to accommodate very few that would seem like a bad idea. If anything, those very few users who really rely on that feature should bear the 'burden' - e.g. use some _very_ uncommon string (#:# or so) to delimit Title and Subtitle - this would also currently be possible and then just require a find&replace at the end...
If we have the situation that some styles require the delimiters to be normalized, while some require the original delimiters to be used, then I think we end up at the very edge of Zoteros localization capabilities. This has to my knowledge already been discussed for quotes: The very idea of quoting in CSL was that quotes should be localized to the current locale (I’m not sure if this is already implemented).
But there are styles which require German titles to use German quotes, and English titles to use English quotes. The same seems to hold true for some styles and title delimiters. This would require language-awareness on a per-title basis. That was discussed for quotes but rejected as being too complicated to implement.
So if one wants serious localization capabilities which get many of the even complicated corner-cases, one should probably stick to LaTeX/biblatex. But it would be nice if Zotero could be used to manage one’s collection and bibtex files only be exported when actually writing papers. This does, in turn, require that Zotero is capable of exporting various BibTeX dialects, including biblatex, which requires to separate parts of the title. So there we are again.
My basic criteria for adding features to CSL is:
- is the feature really needed?
- is it easy enough to implement in both CSL syntax and in CSL-based processors?
This isn't exactly the classic "80/20" rule that developers often cite, but it's similar.Multiple-localization per document (not just of titles, but also of quotes and dates), to my mind, breaks the second rule, and is marginal enough that it's not clear that it satisfies the first.
So if there is a trade-off between the two, I think we'd want to go with the one that keeps CMOS correct and intact.
So probably one could live with something along these lines:
- If the title contains a question mark, split at the (first) question mark.
- If the title contains a colon, split at the (first) colon.
- If the title contains a period, split at the (first) period.
The only corner case would be titles which contain abbreviations and use periods as delimiters. But I think it is ok to change these to using a colon instead.For the very small number of remaining failures, would it be permissible to use backslash escapes on individual characters (Frank ducks)? In felwert's example above, changing U.S. to U\.S\. (would be ugly in the UI but) would provide a workaround for exceptional cases, without breaking the data for CSL purposes. Backslash escapes could be stripped (or a stripped copy of the field could be stored separately) for data exchange or record comparison. If that's too ghastly, we could just leave the edge cases to be touched up by hand.
@felwert: Exactly, that should work. Then in the CSL markup we can treat the two (or three) sides of the split as separate virtual fields, where discrete formatting or selective presentation is required.
@Bruce: What's the "classic '80/20' rule"? My only CS training was a course in Symbolic Language for the IBM 1401 that I took three decades ago. Our instructor must have skipped over that one.
For CSL, every feature we add, and every corner case we decide to cover, means additional work for implementers, and so less likelihood we'll see libraries finished (which is already a problem), and in turn applications built on top of them.
http://forums.zotero.org/discussion/1798/
In terms of the title/subtitle problem, more data entry but less processing logic. However I don't think it would handle all the issues with localizing quotes etc., unless you entered every sort of variant (which implies you're not necessarily entering multiple languages but just multiple ways of rendering the title, author, etc.).
I've always thought "short title" would be the place to put abbreviation of the work common in the field you work in. E.g. TESZ.
I know there is a "journal abbreviation" field for journals, but where would I put a similar abbreviation for standalone work, if not "short title"?
Something like this:
...
Robertson, A. 1997: "Xxxxx xxx", ...
TESz: "T.... E.... Sz... " (XY edt.), ...
Uhlbeck, P. 2001: "Yyyyy yyy", ...
...
Zotero should work in a way that if you have the correct data you get the correct citation output. That's really all it can do - but I do think it should try to come as closely as possible to that ideal.
For an end user that may mean occasional fixing of items in the database that have, e.g. been poorly imported - but that is a one-time operation - much preferable to doing the same thing every time I cite the same source.
And sure, people should proofread their stuff - but I think the fewer manual editing necessary the better, no?
For full note citation, in the subsequent cites, it is very annoying to have a very long title.
It often happens when books have subtitles, but not only.
The notes field cannot be a good place for the short title. Notes are used for supplementary informations.
They appear in the bibliography part, but not for the citations one.
It' s precisely for citations, subsequent part essentially, that there is a need for short titles.
So I would very much appreciate to be able to use the short title field.
Besides, I wondered whether it would be possible to have a parameter ("et-al" alike) to fix a maximum number of characters, followed by a localized string ("...") ?
If anyone has a solution for writing shortened titles, thanks for letting know it
<text variable="title" form="short"/>
That should print the short title.But this is not really related to the issue of having main and subtitle separated.
The question concerned CSL styles.
I'd coded this way, but it didn't seem to work. I must had done mistakes somewhere at that time.
After reading your message, I filled some short-titles fields and verified it at once, and could see that my style worked all right!
Since I'd seen in Zotero Metadata Field that there was no CSL fieldname for short title, I was convinced that it was not possible to use it for the moment, I left the code lines (for the future), but didn't go on filling short-title fields !
I posted my question in this discussion, for it was related to long titles.
I support the idea of having more fields, as separate main title and subtitle, but automatism is dangerous, for there are too many "standards", and they are changing all the time.
Even if have nothing to change in my style, your answer helps me a lot.
Thanks again. Excuse me for the disturbance.
- Contra CM6's claim above, Chicago stipulates that English-language publications need not follow European conventions for subtitles. See CMS 14.97-98. Indeed, many presses insist that their own national conventions be observed on this point, even for citations of material published in a different linguistic tradition. In my experience this line is followed by most presses, nationality aside. And those presses that don't are eclectic in other places where they clearly conflict with Zotero's policy. E.g., they publish bibliographies with a mix of punctuation devices: chevrons around French article titles; American quotes around American-published titles; single quotes around UK-published titles; upside-down opening quotes for German titles, etc.
- TEI recognizes the standard. The element <title> includes as recognized values for the attribute @type both "main" and "subtitle".
- Zotero has always pursued a policy of trying to get out of the fields anything pertaining to metainformation. The punctuation and spacing separating title from subtitle are elements of metapunctuation; they are not intristic to the titles themselves. This is demonstrated by the great number of books that omit on their cover or title page any punctuation delimiting title from subtitle. If the colon or period were really a part of the title, it would appear everywhere all the time.
- Capitalization of a subtitle or subsubtitle is oftentimes dictated by style concerns. Zotero users should not be forced to declare whether a (sub)subtitle should or should not begin with a capital letter. That's a style call.
- Short titles cannot be used for this purpose, since that field is frequently being used for other purposes (such as taming the length of an overly long main title).
- I work at a press that has a regularizing policy, and a number of my authors have resisted or fought our press's editorial policy on this point. Those same critics have usually not considered the important distinction between metapunctuation and intrinsic punctuation. This should be a non-issue. Publishers, like Zotero, are leaning toward publication models that allow readers to view bibliographic data as they wish. The only way to effectively employ this strategy, however, is if main and subtitle fields can be distinguished. Then those who dislike colons can be as happy as those who prefer them.
I'm willing to go further, and say that we need not only a subtitle field, but a sub-subtitle and a sub-sub-subtitle field. Yes, I have seen books with four levels of titles. They are usually horribly edited and dreadful to read. But there they are.Someone may counter that sometimes one needs to record in a Zotero record what national convention has been followed. That is easily accommodated by faithfully keeping the language field up to date.
To those who suggest that such a fix would be a heck of a lot of work, I would encourage a long view of things. It doesn't need to happen now. But it needs to happen eventually. The compelling need is already starting to build up a head of steam as more and more publications in the humanities (where people are passionate about punctuation) take to the Web. Developers should plan accordingly.
See http://forums.zotero.org/discussion/15636, and the concrete proposals and their status at https://github.com/ajlyon/zotero-bits/issues .
I will add that many site translators currently concatenate titles and subtitles that are distinct in the source site to get the present content of the title field. So this would match the structure of much of our source data (including MARC and MODS, as has been noted).
I'm pretty sure, however, that multiple levels of subtitles are not going to happen any time soon.