Apache Spark

Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

20 Alternatives To Apache Spark

AWS Glue

Fully managed extract, transform, and load (ETL) service

Amazon Athena

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
images/2020/04/Amazon-EMR.png}}

Amazon EMR

Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

Amazon Redshift

Learn about Amazon Redshift cloud data warehouse.
images/2020/04/Apache-Beam.png}}

Apache Beam

Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
images/2020/04/Apache-Druid.png}}

Apache Druid

Fast column-oriented distributed data store

Apache Hive

Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
images/2020/04/Apache-Kylin.png}}

Apache Kylin

OLAP Engine for Big Data

Confluent

Confluent offers a real-time data platform built around Apache Kafka.

Datameer

Datameer is a business-user-focused business intelligence (BI) platform for Hadoop.

Google Cloud Dataflow

Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.

Hadoop

Open-source software for reliable, scalable, distributed computing

Hortonworks

Hadoop-Related
images/2020/04/IBM-Analytics-Engine.png}}

IBM Analytics Engine

Analytics Engine is a combined Apache Spark and Apache Hadoop service for creating analytics applications.

MapR

MapR is a leading high-performance data management or IT management solution that integrates Apache Drill, Hadoop and Spark with real-time global event streaming, scalable enterprise storage, and database capabilities in order to control large appli…
images/2020/03/mysql.png}}

MySQL

The world’s most popular open source database

Oracle Database 12c

Simplify database management and automate the information lifecycle with maximum security.

Presto DB

Distributed SQL Query Engine for Big Data (by Facebook)

Sequel Pro

MySQL database management for Mac OS X