Apache Beam
Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
21 Alternatives To Apache Beam
Amazon EMR
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
Apache Kafka
Apache Kafka is an open-source message broker project developed by the Apache Software Foundation written in Scala.
Apache Spark
Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
Apache Spark for Azure HDInsight
This article provides an introduction to Spark in HDInsight and the different scenarios in which you can use Spark cluster in HDInsight.
Azure Data Lake Store
Azure Data Lake Storage Gen2 is highly scalable and secure storage for big data analytics. Maximize costs and efficiency through full integrations with other Azure products.
Azure HDInsight
Azure HDInsight is a managed Apache Hadoop cloud service that lets you run Apache Spark, Apache Hive, Apache Kafka, Apache HBase, and more.
Confluent
Confluent offers a real-time data platform built around Apache Kafka.
Databricks
Databricks provides a Unified Analytics Platform that accelerates innovation by unifying data science, engineering and business.What is Apache Spark?
Gearman
Gearman provides a generic application framework to farm out work to other machines or processes that are better suited to do the work.
Google Cloud Dataflow
Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.
Google Cloud Dataproc
Managed Apache Spark and Apache Hadoop service which is fast, easy to use, and low cost
Hadoop HDFS
The Apache HDFS is a distributed file system that makes it possible to scale a single Apache Hadoop cluster to hundreds (and even thousands) of nodes.
HortonWorks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly…
Qubole
Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.