How to index big PDF file? Error message ExtGState should be a dictionary

xing-han · November 4, 2025

I have some pdf files . It goes wrong when getting fulltext content like this.

https://s3.amazonaws.com/zotero.org/images/forums/u18719689/l732tbjcqhvt77hyzlfk.png

Please help guide how to solve it，thank you .

Submitted with Debug ID D447104597
see Debug Out below：

(3)(+0000005): Indexing item 1/YSITTMH9

(3)(+0000006): Getting fulltext content from item 1/YSITTMH9

(1)(+0000044): Error: Worker 'getFullText' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getFullText/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:650:21

(1)(+0000001): Error: Worker 'getFullText' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getFullText/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:650:21

(3)(+0000001): ProgressQueue: updating row 8, 2, 正在处理

(3)(+0000001): RecognizeDocument: Recognizing attachment GY1《观影说多维实相》之影评荟萃-第一册（1-15） 20250312(证书签名)

(3)(+0000002): Getting PDF recognizer data from item 1/YSITTMH9

(1)(+0000189): Error: Worker 'getRecognizerData' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getRecognizerData/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:694:21

(1)(+0000001): Error: Worker 'getRecognizerData' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getRecognizerData/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:694:21

(1)(+0000001): { "name": "recognizePDF.couldNotRead" "params": [] "_title": "general.error" "cause": undefined "title": "错误" "message": "无法从文件中读取文本" "toString": function () {...} "present": function (window) {...} "log": function () {...} }

(3)(+0000000): ProgressQueue: updating row 8, 3, 无法从文件中读取文本

(3)(+0000005): itemTree.render(). Displaying Item Tree

(3)(+0000010): BlockingObserver: Added observer

(3)(+0000221): Got MIME type application/pdf from extension 'pdf'

(3)(+0000018): itemTree.render(). Displaying Item Tree

(3)(+0000127): Refreshing attachment box

(4)(+0000004): SELECT indexedPages, totalPages AS total FROM fulltextItems WHERE itemID=? [8]

(4)(+0000003): SELECT synced FROM fulltextItems WHERE itemID=? [8]

(4)(+0000002): SELECT synced FROM fulltextItems WHERE itemID=? [8]

martynas_b · November 6, 2025

Could you send the PDF file to support@zotero.org with a link to this thread?

xing-han · November 7, 2025

@martynas_b ‌The mail has been sent out yesterdoy with the download links for these two PDF files. Please download and test them.thank you

GY1《观影说多维实相》之影评荟萃-第一册（1-15）-20250312
https://xiyushe.org/download/42/pdf/42.pdf

Y6-4《已知的实相VI》第4册（271-280）-内在自我的多维结构与运作方式(共四册）
https://xiyushe.org/download/181/pdf/181.pdf

martynas_b · November 7, 2025

The issue will be fixed in the next Zotero Beta update.

dstillman · November 11, 2025

Fixed in the latest Zotero beta

xing-han · November 12, 2025

@dstillman
it works，thank you very much
https://s3.amazonaws.com/zotero.org/images/forums/u18719689/09zohpfzrgrm4bdhhyg0.png