gist.github.com/bartolsthoorn/4b1b71b133c8282a0ff5

Scraper using Simhash with DOM trees. This is a proof of concept that scrapes Hackernews with just 3 lines (34-36) of configuration. - Gist is a simple way to share snippets of text and code with others.


Comments (0)

Sign in to post comments.