How to read pdf metadate with GBK encoding?

edited September 10, 2017
I find it hard for pdf written in Chinese to be encoded with default utf-8 method. Therefore, reading metadata will fail for lack of information.

Would you please pride a way to custom the pdftotext command, let us manually support it with GBK encoding? Once we build the correct parent item, we can think about how to support both utf-8 and GBK encoding.

By the way, I once glimpsed that the version of pdftotext and pdfinfo update to 4.0beta, but back to 3.0.2 later again. I think they have been released to 4.0 for a long time.

This is an old discussion that has not been active in a long time. Before commenting here, you should strongly consider starting a new discussion instead. If you think the content of this discussion is still relevant, you can link to it from your new discussion.

Sign In or Register to comment.