Apache Spark
Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
20 Alternatives To Apache Spark
Amazon Athena
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
Amazon EMR
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
Amazon Redshift
Learn about Amazon Redshift cloud data warehouse.
Apache Beam
Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
Apache Hive
Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
Confluent
Confluent offers a real-time data platform built around Apache Kafka.
Google Cloud Dataflow
Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.
Hortonworks
Hadoop-Related
IBM Analytics Engine
Analytics Engine is a combined Apache Spark and Apache Hadoop service for creating analytics applications.
MapR
MapR is a leading high-performance data management or IT management solution that integrates Apache Drill, Hadoop and Spark with real-time global event streaming, scalable enterprise storage, and database capabilities in order to control large appli…
Oracle Database 12c
Simplify database management and automate the information lifecycle with maximum security.
Sequel Pro
MySQL database management for Mac OS X