Importing PDF with "Create Web Page Item from Current Page" not possible anymore?
Hi,
In the past I've used the "Create Web Page Item from Current Page" button to import PDFs. This worked very well. I then renamed them in Zotero (and saved them externally afterwards).
This does not seem to work any more. At least, I get an error message when I try to open these "not properly imported PDFs".
The link below is to the two files that I tried to import as described above.
http://www.tandfonline.com/doi/pdf/10.1080/03085149000000001
https://www.jstor.org/stable/pdf/591464.pdf?acceptTC=true
The error message that I get from the PDF Xchange Viewer is that the file is apparently in HTML instead of PDF.
Have any recent changes been made to the "Create Web Page Item from Current Page" function so that importing PDF files does not word anymore that way? Could that be reverted?
In the past I've used the "Create Web Page Item from Current Page" button to import PDFs. This worked very well. I then renamed them in Zotero (and saved them externally afterwards).
This does not seem to work any more. At least, I get an error message when I try to open these "not properly imported PDFs".
The link below is to the two files that I tried to import as described above.
http://www.tandfonline.com/doi/pdf/10.1080/03085149000000001
https://www.jstor.org/stable/pdf/591464.pdf?acceptTC=true
The error message that I get from the PDF Xchange Viewer is that the file is apparently in HTML instead of PDF.
Have any recent changes been made to the "Create Web Page Item from Current Page" function so that importing PDF files does not word anymore that way? Could that be reverted?
<</DescendantFonts[9 0 R]/BaseFont/LPZIWC+Code2000/Type/Font/Encoding/Identity-H/Subtype/Type0/ToUnicode 12 0 R>>
instead of
<</DescendantFonts[9 0 R]/BaseFont/OVNCVG+Code2000/Type/Font/Encoding/Identity-H/Subtype/Type0/ToUnicode 12 0 R>>
I am not aware of any recent changes in the save as webpage functionality.
The problem on my side persists with PDF imports from at least the following large publishers/repositories: Taylor & Francis and JStor.
Latest item I have trouble with:
http://www.tandfonline.com/doi/pdf/10.1080/13603124.2014.958199
The size of these apparently "HTML in disguide of PDF files" is about 4 KB which I take as a strong indicator that something went wrong during the import process.
I was able to successfully import other PDFs from the web though.
Using Adobe Acrobat does not make a difference on my side. I get a similar error message.
Do you have any ideas of what might be causing the problem here? I wish you could replicate the problem to assist me with this.
I hope this helps in investigating the problem.
Many thanks!
@Dan: Anything in that error report? Do you have any ideas what might be broken in zurpher's installation?
=> I had to whitelist the cookies from the URLs in Firefox.
http://www.tandfonline.com/
http://www.jstor.org/
This is insofar strange as I allowed first-party cookies already. I only do NOT allow third-party cookies. I wonder what whitelisting cookies in Firefox actually does. Perhaps this also automatically whitelists third-party (tracking) cookies that T&F and JSTOR now enforce. I still do not understand why this created a problem with the Zotero import functionality in the first place. But hey, problem solved.
This was the error message from PDF Xchange Viewer This is what I find in the PDF/HTML file when I manually change it to HTML and then open it in Firefox: Same issue with JSTOR: