UTF-8 conversion error when importing html
Fortunately, most html is nowadays served by default as utf-8, for example anything generated by the drupal cms. When i try to paste parts of such an html-page to zotero, diacritical marks get converted by default (see the screenshot at http://www.flickr.com/photos/13771656@N04/5615737314/). Is there a setting that i have forgotten to adjust or is this a bug?
This discussion has been closed.
But in any case, copying that same line into a note works fine for me—no HTML entity—on OS X.
Edit: I wrote ascii, i meant html-encoding.
Edit: Specifically i'm referring to ' being turned into '
Edit: Heh, that's funny, the forum is doing it backwards. &-apos; without the dash then.
(Last) edit: This is zotero 2.1.1 on firefox 4 on windows 7 sp1 (x64).
From Wikipedia's UTF-8 article: A straight quote—the apos entity in HTML and XML—is just an ASCII character.
And it's not being "translated by zotero to ascii" (in which case it would still just be an apostrophe, but extended characters would be mangled). It's being converted to an HTML/XML entity. [Edit: OK, you corrected your post.] But again, I can't reproduce this.
Still, I'm curious if it may be a setting then. I haven't messed with the default configuration.
Does anyone else on Windows & Firefox 4 want to give it a go? Ik keep getting it with all other addons disabled on two separate machines. Just select some text with a straight quote on the linked page, option/alt-click/tap it, choose Zotero and choose (your english translation of) Zotero-object en -aantekening maken van selectie to reproduce.
Using the context menu option, I can reproduce the problem. We'll look into it.