PDF indexing issue: Korean text not searchable
Hello,
I've recently encountered an issue with PDF indexing. Until last week, I was able to search for Korean text in indexed PDFs. However, Korean words are no longer searchable after rebuilding the index - only English words appear in search results.
Here's what I've tried so far.
- Checked that my PDFs contain selectable text
- Reinstalled and updated to the latest Zotero version
- Cleared and rebuilt the index
Any help would be appreciated.
Thanks
I've recently encountered an issue with PDF indexing. Until last week, I was able to search for Korean text in indexed PDFs. However, Korean words are no longer searchable after rebuilding the index - only English words appear in search results.
Here's what I've tried so far.
- Checked that my PDFs contain selectable text
- Reinstalled and updated to the latest Zotero version
- Cleared and rebuilt the index
Any help would be appreciated.
Thanks
We'll work on fixing this, but you will likely encounter the same issue if you paste Hangul Jamo into Zotero notes, as they also cannot be indexed.
How common are those PDF files?
But I believe these are two symptoms of the same problem so I am leaving this here.
All Korean pdf files I tested, created with hancom hwp, ms word, etc., pastes decomposed korean jamo with zotero 7,
but not with adobe reader now, and never with zotero 6 then.
I think fixing indexing and copy-pasting both is not the way... Please look into how PDFs are handled.
https://s3.amazonaws.com/zotero.org/images/forums/u16515968/v1vxcf75y9mkkj9b4o1e.png
https://s3.amazonaws.com/zotero.org/images/forums/u16515968/i14bh7fqcgtz09ps2tge.png
https://s3.amazonaws.com/zotero.org/images/forums/u16515968/ted2n7hvfivm7qope0km.png
https://s3.amazonaws.com/zotero.org/images/forums/u16515968/tbytf8aafmkb41t7smbr.png
file source: https://data.kostat.go.kr/nowcast/newBigDataBrdMgr.do?boardId=1&menuId=4&subMenuId=1&isPopup=Y