How to read pdf metadate with GBK encoding?
I find it hard for pdf written in Chinese to be encoded with default utf-8 method. Therefore, reading metadata will fail for lack of information.
Would you please pride a way to custom the pdftotext command, let us manually support it with GBK encoding? Once we build the correct parent item, we can think about how to support both utf-8 and GBK encoding.
By the way, I once glimpsed that the version of pdftotext and pdfinfo update to 4.0beta, but back to 3.0.2 later again. I think they have been released to 4.0 for a long time.
Would you please pride a way to custom the pdftotext command, let us manually support it with GBK encoding? Once we build the correct parent item, we can think about how to support both utf-8 and GBK encoding.
By the way, I once glimpsed that the version of pdftotext and pdfinfo update to 4.0beta, but back to 3.0.2 later again. I think they have been released to 4.0 for a long time.
-
specter119edited September 10, 2017Should I submit GBK encoding request to github for developers?
This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.