Remove proxy information from URL field on import

For many publisher websites (e.g., Springer, Sage, Wiley, APA), when I save an item through the primary translator, the URL recorded includes the proxy information for my university library. This appears to be because (inanely), meta-data displayed by the site and the URL link displayed on the article page include the proxy information, rather than the true persistent URL. Would it be possible for the translators to check for this sort of proxy information and strip it out?

Some example pages:
http://onlinelibrary.wiley.com/doi/10.1002/j.2333-8504.1998.tb01781.x/abstract
(compared to http://onlinelibrary.wiley.com.ezp1.lib.umn.edu/doi/10.1002/j.2333-8504.1998.tb01781.x/abstract)

http://link.springer.com/chapter/10.1007/978-1-4757-2691-6_23

http://apm.sagepub.com/content/30/3/216
  • Yes, that should be possible. I actually already have a patch for this that I'll have to dig up.
  • Found it! (only took me a year to issue a pull request...)
  • nice! that should have happened ages ago.
  • Sweet! Thanks.
  • Could the proxy also be removed from the Library Catalog field when the URL is stored there (e.g., www.pnas.org)?
  • We could fix it (in the translator), but I'm not really sure we should be storing the host there anyway. adamsmith, can we come up with a standardized format for Library Catalog field for translators? We use it for troubleshooting (i.e. adding a translator name), but that's probably not compatible with citations.

    Only 3 styles use that variable, but one of them is MLA. In MLA, we only use it in the absence of DOI and URL, which is rare, but my understanding is that the library catalog field in those cases should still refer to the place where the full-text resource can be obtained (so if we're missing URL, we should be missing the library catalog too)
Sign In or Register to comment.