Using Zotero Translator Framework in an external web crawling app?
Hi,
I'm new to this forum so sorry if this is the wrong place to post.
I'm working on a web crawling project related to scientific knowledge infrastructure, and would like the ability to extract the metadata from any URL using the Zotero translator framework (https://github.com/zotero/zotero-connectors?tab=readme-ov-file#zotero-translator-framework)
To be concrete, I would like to create a Python web crawling library with a function F such that F(url) would return a JSON object of the extracted metadata. This metadata would be the same as if I had saved the item to my Zotero library using the Zotero Connector extension. (https://github.com/zotero/zotero-connectors)
Maybe someone is aware of existing projects doing this or something similar? I could always dive into the codebase but since I'm new to this field I thought I'd check first :)
Thanks,
Ronen
I'm new to this forum so sorry if this is the wrong place to post.
I'm working on a web crawling project related to scientific knowledge infrastructure, and would like the ability to extract the metadata from any URL using the Zotero translator framework (https://github.com/zotero/zotero-connectors?tab=readme-ov-file#zotero-translator-framework)
To be concrete, I would like to create a Python web crawling library with a function F such that F(url) would return a JSON object of the extracted metadata. This metadata would be the same as if I had saved the item to my Zotero library using the Zotero Connector extension. (https://github.com/zotero/zotero-connectors)
Maybe someone is aware of existing projects doing this or something similar? I could always dive into the codebase but since I'm new to this field I thought I'd check first :)
Thanks,
Ronen
-
zuphilipHave a look at the translation-server repository https://github.com/zotero/translation-server which is a NodeJS approach doing this. (Wikipedia is using a similar approach called Citoid.)
-
ronentThanks, this is very helpful!