UTF-8 conversion error when importing html

Fortunately, most html is nowadays served by default as utf-8, for example anything generated by the drupal cms. When i try to paste parts of such an html-page to zotero, diacritical marks get converted by default (see the screenshot at http://www.flickr.com/photos/13771656@N04/5615737314/). Is there a setting that i have forgotten to adjust or is this a bug?
  • edited April 13, 2011
    First of all, at least as far as the example in your screenshot goes, this has nothing to do with UTF-8 or diacritics—that's just a plain ASCII straight quote.

    But in any case, copying that same line into a note works fine for me—no HTML entity—on OS X.
  • edited April 13, 2011
    I'm sorry, you misunderstand. The text is provided through the html as utf-8. Within the code it is written as '. On add (select, context-menu, add to library) it is translated by zotero to ascii. So maybe it's a windows thing?

    Edit: I wrote ascii, i meant html-encoding.
    Edit: Specifically i'm referring to ' being turned into '
    Edit: Heh, that's funny, the forum is doing it backwards. &-apos; without the dash then.
    (Last) edit: This is zotero 2.1.1 on firefox 4 on windows 7 sp1 (x64).
  • edited April 13, 2011
    UTF-8 is irrelevant for ASCII characters—they use the same code points.

    From Wikipedia's UTF-8 article:
    In particular, the first 128 characters of the Unicode character set, which correspond one-to-one with ASCII, are encoded using a single octet with the same binary value as the corresponding ASCII character, making valid ASCII text valid UTF-8-encoded Unicode text as well.
    A straight quote—the apos entity in HTML and XML—is just an ASCII character.

    And it's not being "translated by zotero to ascii" (in which case it would still just be an apostrophe, but extended characters would be mangled). It's being converted to an HTML/XML entity. [Edit: OK, you corrected your post.] But again, I can't reproduce this.
  • :-) You reply really fast. Thanks for setting me straight.
    Still, I'm curious if it may be a setting then. I haven't messed with the default configuration.

    Does anyone else on Windows & Firefox 4 want to give it a go? Ik keep getting it with all other addons disabled on two separate machines. Just select some text with a straight quote on the linked page, option/alt-click/tap it, choose Zotero and choose (your english translation of) Zotero-object en -aantekening maken van selectie to reproduce.
  • edited April 13, 2011
    Oh, wait—I missed above where you said you were using the context menu option. I thought you were just copying and pasting into a Zotero note.

    Using the context menu option, I can reproduce the problem. We'll look into it.
  • OK, fixed in the latest dev build. The fix will be included in 2.1.6. Thanks.
  • Awesome, thanks. Should anything else ever come up, I'll try to think before i post in order to be a bit more specific.
This discussion has been closed.