I've notice that Zotero has a problem with polish letter "ł/Ł"
Everytime i import anything with that letter it saves it as /l or /L. Does anyone else have a similar problem? I've tried reinstalling Zotero, but without any results.
I have had no problem importing those characters from publisher sites nor from a database such as PubMed. There have been a few problems with importing (or copy/paste) from WorldCat.
1. https://scholar.google.com/scholar?cluster=249138857392321717&hl=en&oi=scholarr
2. https://www.researchgate.net/publication/322765059_Orientacje_spoleczne_a_plec_Czy_rywalizacja_to_tylko_meska_rzecz
3. https://www.ceeol.com/search/article-detail?id=283274
4. http://www.depotuw.ceon.pl/handle/item/1014
I start to believe it is my computer/browser fault.
How about the Wikipedia article? Same issue?
Because of the way that GS compiles the item list (mostly by capturing online versions of the publications themselves but also by grabbing citations with OCR from article reverence lists) the characters can be rendered in a peculiar way.
Getting each item separately is possible but it's not a system solution to this problem. It doesn't match my workflow in which I want to build problem-focused bibliographies quickly. Crossref gets ł right, but it doesn't recognise all records from GS.
I may be wrong about this but Zotero seems to do a really good job of recognizing and correcting character encoding even when it is different from what is declared in the page header. "Funny" characters displayed on the screen and in the webpage headers can miraculously be correctly captured by Zotero translators.
The problem with Google Scholar is that the records do not necessarily have the same character encoding scheme even within the same record.
The way that GS populates its database, by uncurated capturing of its contents from across the web, will cause errors. I could go on a long-winded further explanation of the problem with character encoding and how, from experience with my own online bibliographic database, even some publishers' websites get it wrong online and in the metadata they provide to databases but this isn't the place for that.
A speed-priority workflow especially that based on uncritical collection of raw metadata from GS will have not only errors due to character encoding -- it will also have volume, issue, and pagination errors. It will have omitted author names and authors in the wrong order. Then, if you openly share your bibliography online, you will reinforce the error when GS scans your site -- further compounding the problem.
This has nothing to do with character encoding. Zotero uses BibTeX output from GS, which consistently renders ł as
{\l}
I assume that's correct and we just don't have this in our tables, but even if it's incorrect BibTeX, it's a simple fix on GS import, we'll take a look.
Specifically, here's what the GS translator gets as the BibTeX:
@book{derbis1998poczucie,
title={Poczucie jako{\'s}ci {\.z}ycia a swoboda dzia{\l}ania i odpowiedzialno{\'s}{\'c}},
author={Derbis, Romuald and Ba{\'n}ka, Augustyn},
year={1998},
publisher={Stowarzyszenie Psychologia i Architektura}
}
And here's what the BibTeX translator conferts that to (which is essentially what is being used as the title at the end of the day):
@book{derbis_poczucie_1998,
title = {Poczucie jakości życia a swoboda dzia{\textbackslash}lania i odpowiedzialność},
publisher = {Stowarzyszenie Psychologia i Architektura},
author = {Derbis, Romuald and Bańka, Augustyn},
year = {1998},
}
OTMH I don't know whether that {\l} is correct or not, if yes, then it must be the BibTeX translator, which does a lot of low-level parsing inside.
no idea why we didn't fix this 6 years ago when it first came up.