Apache Nutch
Apache Nutch is a highly extensible and scalable open source web crawler software project.
17 Alternatives To Apache Nutch
80legs
Custom Web Scraping & Powerful Web Crawling.
ACHE Crawler
ACHE is a web crawler for domain-specific search.
Algolia
Algolia’s Search API makes it easy to deliver a great search experience in your apps & websites. Algolia Search provides hosted full-text, numerical, faceted and geolocalized search.
Apify
Apify is a web scraping and automation platform that can turn any website into an API.
Conseris
Data Starts Here. Gather and explore your data from anywhere on the planet.
Content Grabber
Content Grabber is an automated web scraping tool.
Enketo Smart Paper
Web forms evolved. Deploy and conduct surveys that work without a connection, on any device.
Heritrix
Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web…
OnCrawl
OnCrawl is a SEO crawler and log analyzer that helps you improve your SEO performance. Increase traffic, rankings & revenues. Start your 14-day free trial.
Open Data Kit
The Open Data Kit community produces free and open-source software for collecting, managing, and using data in resource-constrained environments.
ParseHub
ParseHub is a free web scraping tool. With our advanced web scraper, extracting data is as easy as clicking the data you need.
ProxyCrawl
ProxyCrawl stay anonymous while crawling the web. Avoid captchas, blocks and proxies. Crawling and scraping protection
ScrapeHero
A web scraping service to collect data from websites, without any programming or DIY tools.
Scrapy
Scrapy | A Fast and Powerful Scraping and Web Crawling Framework
Scrapy Cloud
A cloud-based web crawling platform, allows you to easily deploy crawlers and scale them on demand.
SurveyCTO
Collect data you can trust Offline or online, in the field, on the street, or in the lab.
Swiftype
The simplest way to add search to your website or application. Sign up for free.