From 7456fba0b4cad671acdf02acd12c0022636ff8b3 Mon Sep 17 00:00:00 2001 From: Andreas Kull Date: Thu, 31 May 2018 12:18:16 +0200 Subject: [PATCH] Add webmagic --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 68f4089..a3c2814 100644 --- a/README.md +++ b/README.md @@ -958,6 +958,7 @@ A curated list of awesome Java frameworks, libraries and software. * [Crawler4j](https://github.com/yasserg/crawler4j) - Simple and lightweight web crawler. * [jsoup](https://jsoup.org/) - Scrapes, parses, manipulates and cleans HTML. * [StormCrawler](http://stormcrawler.net/) - SDK for building low-latency and scalable web crawlers. +* [webmagic](https://github.com/code4craft/webmagic) - Scalable crawler with downloading, url management, content extraction and persistent. ## Web Frameworks