Apache Spark

Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

20 Alternatives To Apache Spark

Amazon Athena

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
images/2020/04/Amazon-EMR.png}}

Amazon EMR

Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

Amazon Redshift

Learn about Amazon Redshift cloud data warehouse.
images/2020/04/Apache-Beam.png}}

Apache Beam

Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.

Apache Hive

Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.

Confluent

Confluent offers a real-time data platform built around Apache Kafka.

Google Cloud Dataflow

Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.

Hortonworks

Hadoop-Related
images/2020/04/IBM-Analytics-Engine.png}}

IBM Analytics Engine

Analytics Engine is a combined Apache Spark and Apache Hadoop service for creating analytics applications.

MapR

MapR is a leading high-performance data management or IT management solution that integrates Apache Drill, Hadoop and Spark with real-time global event streaming, scalable enterprise storage, and database capabilities in order to control large appli…

Oracle Database 12c

Simplify database management and automate the information lifecycle with maximum security.

Sequel Pro

MySQL database management for Mac OS X