[5.0 Beta] Patent RSS Feeds

edited February 10, 2017
Running some patent RSS feeds from Espacenet, and the 'creator' (inventor) field seems to be bugged, i.e. most items show something like "[US and [IN" where the RSS feed provides <dc:creator>KURIAN MANU JACOB [US]; PURUSHOTHAMAN SASIDHAR [IN]</dc:creator> (See example Feed URL and snippet from the RSS source below). Other items either show creator as blank, but some have the correct info.

Other requests: I made a similar request for zotero 4.x but it would be extremely helpful to show patent publication number/identifier and patent assignee/applicant in the main list (in Feeds and in the Library). Also, sort functionality (I think this has been mentioned elsewhere for the Beta). Thanks

Example Feed URL: https://worldwide.espacenet.com/websyndication/searchFeed?DB=EPODOC&amp;ST=singleline&amp;Submit=Search&amp;locale=en_EP&amp;query=cpc=G06K19/0723+AND+pd>=20161201)

(edit: not sure how to post code here, the tags were all removed, see the source from the RSS URL above)
RSS snippet:

<item>
<title>Resource tag generation and deployment for resource valuation and distribution</title>
<link>https://worldwide.espacenet.com/publicationDetails/biblio?CC=US&amp;NR=9563873B1&amp;KC=B1&amp;FT=D&amp;DB=EPODOC&amp;locale=en_EP&amp;date=20170207&amp;rss=true</link>
<pubDate>Tue, 07 Feb 2017 12:00:00 GMT</pubDate>
<guid isPermaLink="false">https://worldwide.espacenet.com/publicationDetails/biblio?CC=US&amp;NR=9563873B1&amp;KC=B1&amp;FT=D&amp;DB=EPODOC&amp;locale=en_EP&amp;date=20170207&amp;rss=true</guid>
<dc:creator>KURIAN MANU JACOB [US]; PURUSHOTHAMAN SASIDHAR [IN]</dc:creator>
<dc:publisher>Espacenet</dc:publisher>
<dc:date>2017-02-07T12:00:00Z</dc:date>
<dc:identifier>US9563873B1 (B1)</dc:identifier>
<esp:applicants>BANK OF AMERICA [US]</esp:applicants>
<esp:cpcClassifications>G06Q10/087; G06K7/10366 (2)</esp:cpcClassifications>
<esp:ipcClassifications>G06F17/00; G06K17/00 (1)</esp:ipcClassifications>
<esp:publicationCycle>US9563873B1 - 2017-02-07</esp:publicationCycle>
<esp:priorityDate>2016-03-28</esp:priorityDate>
</item>
  • Regarding author parsing I've submitted a pull request so there should be an improvement on that front. Although the feed does present authors in a weird format and perhaps a good way to address this would be talking to espacenet. They should at least not provide authors in all-caps. Do note, that if you drag the item into your library or use the Add to My Library/Collection button in the item pane Zotero fetches correct metadata.

    Regarding your second request - we'd generally want more customisability for the item list, but that comes at the cost of increased complexity (both in terms of code and UI) and this is not an often requested feature, so no promises on that one any time soon. We do welcome code contributions from the community if you want to give it a shot.
  • adomasven, thanks for the reply. I think espacenet's policy on formatting is to transparently pass whatever they receive from the different international publishing agencies through to their users unedited, but it could certainly be a worthwhile conversation as Zotero has the potential to be massively more useful as a patent citation manager than the next-most-useful solution, IMO.

    I guess I underestimated the difficulty in adding fields to the column/sort UI list, since the 'patent number' and 'assignee' fields are already parsed into the database for each cited patent. My coding skills are pretty weak, and not sure I'd know where to start, but maybe I could maybe take a stab at it...
  • I guess I underestimated the difficulty in adding fields to the column/sort UI list, since the 'patent number' and 'assignee' fields are already parsed into the database for each cited patent.
    This has less to do with the difficulty in adding these fields to the columns elector and more with the fact that there are too many fields to display them all. I am not really sure how the current list came about, but I assume there is some logic to prioritising the fields that are in it. Allowing a greater flexibility would mean adding a more rigorous and thorough solution to show/hide all possible columns for all item types, perhaps somewhere in the preferences pane. On the other hand, exposing too much customisability to the end-users makes the whole software less user-friendly. As introducing this requires significant consideration, yet there are not many requests, this isn't currently a high priority, unfortunately.
    My coding skills are pretty weak, and not sure I'd know where to start, but maybe I could maybe take a stab at it...
    FWIW, it is fairly easy to write a Zotero plugin that reads a list of fields from a config file and adds them to the column selector.
  • I'd like to attempt a plugin. Assuming I want it to work in Zotero 5, should I start with looking at the Javascript API? The documentation is a little confusing, but I'm assuming anything to do with Firefox/Connector/XUL/SQLite won't help me.
  • I am not really sure how the current list came about, but I assume there is some logic to prioritising the fields that are in it.
    I think that's attributing more system to the process than justified. IIRC, the current list was proposed by @Gracile with Dan, me, and some others chiming in. I don't think it should be treated as eternal truth. I think broad applicability was a criterion and in that vein, I don't think Assignee (which just exists for Patents) makes sense, but I think Number (which exists for a whole range of item types) does. I think I mentioned somewhere else that I'd be very much in favor of adding that to the "More Columns" list.
  • With no guarantees that all of this is up to date, plugin development is described here:
    https://www.zotero.org/support/dev/sample_plugin

    Zotero 5.0 sill uses the XULrunner framework, so we're still talking basically Firefox add-ons (to be installed in Standalone).
  • edited February 15, 2017
    My vote would still be for a dedicated Assignee column, as people who rarely cite patents would probably never use it anyway, but it would be really useful for patent people.

    But if not, maybe append Assignee to the Creator (Inventor) field. - i.e. "Bartman et al. (Microsoft)" if a cited item has a non-empty assignee field... You still couldn't sort by Assignee, but it would be better than nothing.

    In any case I'll take a deeper look at the plugin dev area.
  • I think that's attributing more system to the process than justified. IIRC, the current list was proposed by @Gracile with Dan, me, and some others chiming in. I don't think it should be treated as eternal truth.
    Discussion here: https://github.com/zotero/zotero/issues/84
  • Great, thanks -- that's pretty much how I remembered it and as you can tell not super systematic. I also think the cost of adding columns to "More Columns" is relatively low as long as we don't go crazy.
  • It might even be convenient to further submenu item-specific fields together.
Sign In or Register to comment.