Good way to save Comments from website articles?
Anyone have any good advise on how to save all the comments from an article on a news website or blog?
There are several problems with doing this:
(1) Since many of the sites have moved to loading the comments with XMLHttpRequest calls, when you add the page itself to your Zotero library, NONE of the comments are saved with it.
(2) I have never found a site that has a link that allows you to view all the comments on one page, which would at least allow you to attach that page separately.
So what options are we left with? The only way I've found is to copy and paste the comments into an attached Note to my Zotero entry for that article. It is, however, very time consuming if you want to get all of the comments, especially if there are hundreds of them and the damn website will only allow you to view 25 or 50 at a time.
Anyone else have other techniques you've used to save comments?
Cheers
--Dan
There are several problems with doing this:
(1) Since many of the sites have moved to loading the comments with XMLHttpRequest calls, when you add the page itself to your Zotero library, NONE of the comments are saved with it.
(2) I have never found a site that has a link that allows you to view all the comments on one page, which would at least allow you to attach that page separately.
So what options are we left with? The only way I've found is to copy and paste the comments into an attached Note to my Zotero entry for that article. It is, however, very time consuming if you want to get all of the comments, especially if there are hundreds of them and the damn website will only allow you to view 25 or 50 at a time.
Anyone else have other techniques you've used to save comments?
Cheers
--Dan
Here are some examples:
The NPR.org site uses the DISQUS comments service, which loads with Javascript after the article page is loaded:
http://www.npr.org/blogs/thetwo-way/2013/01/18/169646736/as-social-issues-drive-young-from-church-leaders-try-to-keep-them
Go to that page and save it into Zotero, then view the snapshot it took. You'll see the page, but the area where comments are loaded is blank. I'm noticing more and more news sites using DISQUS. Here's another example, though with this one you can't tell it's using DISQUS until you go to login:
http://www.kentucky.com/2012/11/30/2426599/asbury-professor-writes-of-democrats.html
Many other news sites are starting to use the Facebook comments API, which works in a similar way. They may load the first 50 or so comments onto the initial HTML that's sent. But if there are more and you keep clicking the "View previous comments" link to eventually show all of them on the page, then save the page to Zotero, it doesn't save all the comments in the snapshot, only the ones that initially showed on the page. So in that case, the only way to save all of them is to show them all, the copy and paste them into a note. But then you run into the issue that if there are a lot (say a thousand), Zotero won't allow you to save that much into a note.
The NYTimes.com site, however, looks like it works the same way, yet Zotero seems to be able to save the comments that are shown after clicking the "READ MORE COMMENTS" link at the bottom of the comments. Here's an example:
http://takingnote.blogs.nytimes.com/2013/01/17/no-comment-necessary-conspiracy-nation/?ref=opinion
--Dan
I'm looking into how we can adapt the code from that extension for Zotero Snapshots actually.
Credit goes to Rintze for discovering this btw.
Now if the Mozilla Archive Format extension could save only selected parts of a page (like Evernote Web Clipper) it would be perfect :-)
Thanks
http://forums.zotero.org/discussion/27739?page=1#Item_1
On the Wikipedia page about MAFF[1] i found a chrome extension "which uses the data URI scheme to package everything in a single .html file"[2]
[1]: http://en.wikipedia.org/wiki/Mozilla_Archive_Format
[2]: https://chrome.google.com/webstore/detail/mpiodijhokgodhhofbcjdecpffjipkle
aurimas, did you ever make any progress in figuring out how Evernote's Clipper captures pages, or perhaps how the Mozilla Archive Format captures the entire page, comments and all?
Thanks
--Dan