How to read pdf metadate with GBK encoding?
I find it hard for pdf written in Chinese to be encoded with default utf-8 method. Therefore, reading metadata will fail for lack of information.
Would you please pride a way to custom the pdftotext command, let us manually support it with GBK encoding? Once we build the correct parent item, we can think about how to support both utf-8 and GBK encoding.
By the way, I once glimpsed that the version of pdftotext and pdfinfo update to 4.0beta, but back to 3.0.2 later again. I think they have been released to 4.0 for a long time.
Would you please pride a way to custom the pdftotext command, let us manually support it with GBK encoding? Once we build the correct parent item, we can think about how to support both utf-8 and GBK encoding.
By the way, I once glimpsed that the version of pdftotext and pdfinfo update to 4.0beta, but back to 3.0.2 later again. I think they have been released to 4.0 for a long time.
-
specter119edited September 10, 2017Should I submit GBK encoding request to github for developers?