Suggested StormCrawler for web crawler section
This commit is contained in:
@ -848,6 +848,7 @@ A curated list of awesome Java frameworks, libraries and software.
|
|||||||
* [Apache Nutch](http://nutch.apache.org/) - Highly extensible, highly scalable web crawler for production environments.
|
* [Apache Nutch](http://nutch.apache.org/) - Highly extensible, highly scalable web crawler for production environments.
|
||||||
* [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler.
|
* [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler.
|
||||||
* [JSoup](http://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML.
|
* [JSoup](http://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML.
|
||||||
|
* [StormCrawler](http://stormcrawler.net/) - SDK for building low-latency, scalable web crawlers on [Apache Storm](http://storm.apache.org/).
|
||||||
|
|
||||||
## Web Frameworks
|
## Web Frameworks
|
||||||
|
|
||||||
|
Reference in New Issue
Block a user