How to index big PDF file? Error message ExtGState should be a dictionary
I have some pdf files . It goes wrong when getting fulltext content like this.
https://s3.amazonaws.com/zotero.org/images/forums/u18719689/l732tbjcqhvt77hyzlfk.png
Please help guide how to solve it,thank you .
Submitted with Debug ID D447104597
see Debug Out below:
(3)(+0000005): Indexing item 1/YSITTMH9
(3)(+0000006): Getting fulltext content from item 1/YSITTMH9
(1)(+0000044): Error: Worker 'getFullText' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getFullText/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:650:21
(1)(+0000001): Error: Worker 'getFullText' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getFullText/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:650:21
(3)(+0000001): ProgressQueue: updating row 8, 2, 正在处理
(3)(+0000001): RecognizeDocument: Recognizing attachment GY1《观影说多维实相》之影评荟萃-第一册(1-15) 20250312(证书签名)
(3)(+0000002): Getting PDF recognizer data from item 1/YSITTMH9
(1)(+0000189): Error: Worker 'getRecognizerData' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getRecognizerData/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:694:21
(1)(+0000001): Error: Worker 'getRecognizerData' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getRecognizerData/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:694:21
(1)(+0000001): { "name": "recognizePDF.couldNotRead" "params": [] "_title": "general.error" "cause": undefined "title": "错误" "message": "无法从文件中读取文本" "toString": function () {...} "present": function (window) {...} "log": function () {...} }
(3)(+0000000): ProgressQueue: updating row 8, 3, 无法从文件中读取文本
(3)(+0000005): itemTree.render(). Displaying Item Tree
(3)(+0000010): BlockingObserver: Added observer
(3)(+0000221): Got MIME type application/pdf from extension 'pdf'
(3)(+0000018): itemTree.render(). Displaying Item Tree
(3)(+0000127): Refreshing attachment box
(4)(+0000004): SELECT indexedPages, totalPages AS total FROM fulltextItems WHERE itemID=? [8]
(4)(+0000003): SELECT synced FROM fulltextItems WHERE itemID=? [8]
(4)(+0000002): SELECT synced FROM fulltextItems WHERE itemID=? [8]
https://s3.amazonaws.com/zotero.org/images/forums/u18719689/l732tbjcqhvt77hyzlfk.png
Please help guide how to solve it,thank you .
Submitted with Debug ID D447104597
see Debug Out below:
(3)(+0000005): Indexing item 1/YSITTMH9
(3)(+0000006): Getting fulltext content from item 1/YSITTMH9
(1)(+0000044): Error: Worker 'getFullText' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getFullText/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:650:21
(1)(+0000001): Error: Worker 'getFullText' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getFullText/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:650:21
(3)(+0000001): ProgressQueue: updating row 8, 2, 正在处理
(3)(+0000001): RecognizeDocument: Recognizing attachment GY1《观影说多维实相》之影评荟萃-第一册(1-15) 20250312(证书签名)
(3)(+0000002): Getting PDF recognizer data from item 1/YSITTMH9
(1)(+0000189): Error: Worker 'getRecognizerData' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getRecognizerData/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:694:21
(1)(+0000001): Error: Worker 'getRecognizerData' failed: {"error":"{\"message\":\"ExtGState should be a dictionary.\",\"name\":\"FormatError\"}"} getRecognizerData/<@chrome://zotero/content/xpcom/pdfWorker/manager.js:694:21
(1)(+0000001): { "name": "recognizePDF.couldNotRead" "params": [] "_title": "general.error" "cause": undefined "title": "错误" "message": "无法从文件中读取文本" "toString": function () {...} "present": function (window) {...} "log": function () {...} }
(3)(+0000000): ProgressQueue: updating row 8, 3, 无法从文件中读取文本
(3)(+0000005): itemTree.render(). Displaying Item Tree
(3)(+0000010): BlockingObserver: Added observer
(3)(+0000221): Got MIME type application/pdf from extension 'pdf'
(3)(+0000018): itemTree.render(). Displaying Item Tree
(3)(+0000127): Refreshing attachment box
(4)(+0000004): SELECT indexedPages, totalPages AS total FROM fulltextItems WHERE itemID=? [8]
(4)(+0000003): SELECT synced FROM fulltextItems WHERE itemID=? [8]
(4)(+0000002): SELECT synced FROM fulltextItems WHERE itemID=? [8]
Upgrade Storage
GY1《观影说多维实相》之影评荟萃-第一册(1-15)-20250312
https://xiyushe.org/download/42/pdf/42.pdf
Y6-4《已知的实相VI》第4册(271-280)-内在自我的多维结构与运作方式(共四册)
https://xiyushe.org/download/181/pdf/181.pdf
it works,thank you very much
https://s3.amazonaws.com/zotero.org/images/forums/u18719689/09zohpfzrgrm4bdhhyg0.png