Apache Spark for Azure HDInsight
This article provides an introduction to Spark in HDInsight and the different scenarios in which you can use Spark cluster in HDInsight.
22 Alternatives To Apache Spark for Azure HDInsight
ASG Enterprise Data Intelligence
Enterprise Data Intelligence drives business with data lineage.
Adabas SOA Gateway
The Software AG TECHcommunity is the one-stop online global user community portal for Software AG leading product brands: webMethods, Adabas-Natural, ARIS, Terracotta, Apama and Alfabet .
Amazon EMR
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
Apache Ambari
Ambari is aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Hadoop clusters.
Apache Apex
Apache Apex is an enterprise-grade unified stream and batch processing engine.
Apache Beam
Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
Azure Data Lake Store
Azure Data Lake Storage Gen2 is highly scalable and secure storage for big data analytics. Maximize costs and efficiency through full integrations with other Azure products.
Azure HDInsight
Azure HDInsight is a managed Apache Hadoop cloud service that lets you run Apache Spark, Apache Hive, Apache Kafka, Apache HBase, and more.
Cloud Dataprep
Cloud Dataprep by Trifacta is a data prep & cleansing service for exploring, cleaning & preparing datasets using a simple drag & drop browser environment
Databricks
Databricks provides a Unified Analytics Platform that accelerates innovation by unifying data science, engineering and business.What is Apache Spark?
Google Cloud Dataflow
Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.
Google Cloud Dataproc
Managed Apache Spark and Apache Hadoop service which is fast, easy to use, and low cost
HVR
Your data. Where you need it. HVR is the leading independent real-time data replication solution that offers efficient data integration for cloud and more.
Hadoop HDFS
The Apache HDFS is a distributed file system that makes it possible to scale a single Apache Hadoop cluster to hundreds (and even thousands) of nodes.
HortonWorks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly…
MapR
MapR is a leading high-performance data management or IT management solution that integrates Apache Drill, Hadoop and Spark with real-time global event streaming, scalable enterprise storage, and database capabilities in order to control large appli…
Oracle Big Data
Oracle Big Data offers solutions to help organize and analyze diverse data sources alongside existing data.
Qubole
Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.
ZeroFOX
ZeroFOX is a social risk management tool that enables organizations to identify, manage and mitigate social media based cyber threats.
Zettaset Fast-PATH
Speeds Hadoop deployments, improves operational efficiencies, and significantly reduces costly IT support and maintenance requirements MOUNTAIN VIEW, Calif.