Issues with Copying Norwegian Character "å" from PDFs

When I copy text from this PDF (and other PDF files in norwegian), the letter "å" is not copied correctly (see image 1 below where the error is highlighted in yellow). As you can see, the correct character is copied when using Firefox (4), but errors are made in Zotero 7 (Windows) and Zotero web (in MS Edge).

https://s3.amazonaws.com/zotero.org/images/forums/u6115020/94sxb0kcqax5ow3w65aa.png

Picture 2 below shows the text in font Calibri, where "å" apparently looks correct, but is not recognized as an "å" by the spelling function in Word/OneNote.

https://s3.amazonaws.com/zotero.org/images/forums/u6115020/bw40z6sic5dbz7dg0pmr.png

Does anyone know what the cause of this error could be?

Character copied from Zotero: å
  • It's the same character. Those are just different normalized Unicode forms — one precomposed character (NFC) and one that's the letter "a" plus "combining ring above" (NFD). That seems like a bug in the Courier Prime font (which I see as well, and don't see in Courier New), and I guess Word and OneNote aren't normalizing for spelling.

    For what it's worth, Zotero on macOS seems to be copying as NFC and Zotero on Windows seems to be copying as NFD. We'll have to check whether there's a good reason for that. But both forms are valid, and the font is the problem here.
  • Ok, thanks for the answer.

    Hope there is a way to fix this in the future. It is mildly slow impractical to feel this way.

    PS. as I wrote, in word, regardless of font type, there is a red underline in all words, copied from Zotero, which contain "å".
Sign In or Register to comment.