Pdf with incorrect metadata
I am on ECHR case database site. I search and obtain the case I want. I am given the option to download pdf. I click on their pdf button. It downloads the correct case in pdf format but goes on to download completely wrong metadata. I've tried it several times and with different cases and get the same result. Help please.
http://hudoc.echr.coe.int/sites/eng/Pages/search.aspx#{%22fulltext%22:[%22A%22,%22B%20and%20C%20v%20Ireland%22],%22documentcollectionid%22:[%22COMMITTEE%22,%22DECISIONS%22,%22COMMUNICATEDCASES%22,%22CLIN%22,%22ADVISORYOPINIONS%22,%22REPORTS%22,%22RESOLUTIONS%22]}
When I click on their pdf I get the pdf of the case I want but when it is retrieving the metadata it gives an entirely different case. I downloaded A, B and C v Ireland and got metadata on Sejdic & Finci v Bosnia ....
In your case the PDF does not contain a DOI and is not indexed by Google Scholar, so it should not return any metadata. However, because of a long string of very short lines in the PDF, Zotero was producing some very weak queries to Google Scholar and picking up that false positive article you are referring to. This will be fixed.
Additionally, with the above issue fixed, you still get a false positive, because there is a review of that case that contains a couple verbatim quotes. We'll try to fix that false positive as well.
Unfortunately for you, even with the above fixes, you will not be able to retrieve metadata directly from the PDF, but at least you will not have incorrect metadata.
Depending on how popular the HUDOC website is (I have no clue), it might be worth writing a translator for it. The only problem is that it seems to use a lot of AJAX, which complicates this quite a bit.