Suggested StormCrawler for web crawler section

This commit is contained in:
Matt Burns
2017-08-01 16:28:31 +01:00
committed by GitHub
parent 248d545e0c
commit 8607b81359

View File

@ -848,6 +848,7 @@ A curated list of awesome Java frameworks, libraries and software.
* [Apache Nutch](http://nutch.apache.org/) - Highly extensible, highly scalable web crawler for production environments.
* [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler.
* [JSoup](http://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML.
* [StormCrawler](http://stormcrawler.net/) - SDK for building low-latency, scalable web crawlers on [Apache Storm](http://storm.apache.org/).
## Web Frameworks