From 8607b81359e8266508973b8b080c8dc0eff8e92d Mon Sep 17 00:00:00 2001 From: Matt Burns Date: Tue, 1 Aug 2017 16:28:31 +0100 Subject: [PATCH 1/2] Suggested StormCrawler for web crawler section --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index c8cde72..c7d93f7 100644 --- a/README.md +++ b/README.md @@ -848,6 +848,7 @@ A curated list of awesome Java frameworks, libraries and software. * [Apache Nutch](http://nutch.apache.org/) - Highly extensible, highly scalable web crawler for production environments. * [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler. * [JSoup](http://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML. +* [StormCrawler](http://stormcrawler.net/) - SDK for building low-latency, scalable web crawlers on [Apache Storm](http://storm.apache.org/). ## Web Frameworks From 92041873d324e1bb6f015f064a7b699658cd3b28 Mon Sep 17 00:00:00 2001 From: Andreas Kull Date: Tue, 24 Oct 2017 12:13:55 +0200 Subject: [PATCH 2/2] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index c7d93f7..f61632a 100644 --- a/README.md +++ b/README.md @@ -848,7 +848,7 @@ A curated list of awesome Java frameworks, libraries and software. * [Apache Nutch](http://nutch.apache.org/) - Highly extensible, highly scalable web crawler for production environments. * [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler. * [JSoup](http://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML. -* [StormCrawler](http://stormcrawler.net/) - SDK for building low-latency, scalable web crawlers on [Apache Storm](http://storm.apache.org/). +* [StormCrawler](http://stormcrawler.net/) - SDK for building low-latency and scalable web crawlers. ## Web Frameworks