images/2020/04/Apache-Parquet.png}}

Apache Parquet

Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem.

22 Alternatives To Apache Parquet

AWS Glue

Fully managed extract, transform, and load (ETL) service

Amazon Athena

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
images/2020/04/Amazon-EMR.png}}

Amazon EMR

Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

Amazon Redshift

Learn about Amazon Redshift cloud data warehouse.
images/2020/04/Apache-Druid.png}}

Apache Druid

Fast column-oriented distributed data store

Apache Hive

Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.

Apache Kudu

Apache Kudu is Hadoop’s storage layer to enable fast analytics on fast data.
images/2020/04/Apache-Kylin.png}}

Apache Kylin

OLAP Engine for Big Data

Apache Spark

Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

BlueData

BlueData’s software platform makes it easier, faster and more cost-effective for organizations to deploy Big Data infrastructure on-premises.

Chartio

Chartio is a powerful business intelligence tool that anyone can use.

Hortonworks

Hadoop-Related

Impala

Impala is a modern, open source, distributed SQL query engine for Apache Hadoop.

Looker

Looker makes it easy for analysts to create and curate custom data experiences—so everyone in the business can explore the data that matters to them, in the context that makes it truly meaningful.
images/2020/04/Panoply.png}}

Panoply

Panoply is a smart cloud data warehouse

Presto DB

Distributed SQL Query Engine for Big Data (by Facebook)

Qubole

Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.

RJ Metrics

RJMetrics provides hosted business intelligence & data analysis software to companies that operate online.
images/2020/04/Segment.png}}

Segment

We make customer data simple.

Talend

Talend Cloud delivers a single, open platform for data integration across cloud and on-premises environments. Put more data to work for your business faster with Talend.

Treasure Data

Treasure Data is an end-to-end, fully-managed cloud service for big data.

Vertica

Vertica is a grid-based, column-oriented database designed to manage large, fast-growing volumes of…