Full text indexing of djvu files
I like very much that Zotero indexes full texts of all attached PDFs. However, I have a significant part of my collection in djvu format (it offers better compression of scanned images than PDF, which is especially important for books). All these files also have OCR text inside, which potentially can be indexed.
So is it planned to add support for indexing of this format in the future? I guess, it can be done completely analogously to PDFs, but using some other utility to get text out of it instead of pdf2text (though I can not directly point to any such utility right now).
So is it planned to add support for indexing of this format in the future? I guess, it can be done completely analogously to PDFs, but using some other utility to get text out of it instead of pdf2text (though I can not directly point to any such utility right now).
Ticket created, though.
So assuming IPC won't happen anytime soon, we could probably bundle two generic runner scripts—a VBS for Windows and a shell script for OS X/Linux—and just use those to launch programs (with the path passed as a parameter) and redirect the output to a file.