Apache Parquet

Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem.

Amazon Athena

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

Amazon EMR

Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

Amazon Redshift

Learn about Amazon Redshift cloud data warehouse.

Apache Hive

Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.

Apache Kudu

Apache Kudu is Hadoop’s storage layer to enable fast analytics on fast data.

Apache Spark

Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

BlueData

BlueData’s software platform makes it easier, faster and more cost-effective for organizations to deploy Big Data infrastructure on-premises.

Hortonworks

Hadoop-Related

Impala

Impala is a modern, open source, distributed SQL query engine for Apache Hadoop.

Looker

Looker makes it easy for analysts to create and curate custom data experiences—so everyone in the business can explore the data that matters to them, in the context that makes it truly meaningful.

Qubole

Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.

RJ Metrics

RJMetrics provides hosted business intelligence & data analysis software to companies that operate online.

Talend

Talend Cloud delivers a single, open platform for data integration across cloud and on-premises environments. Put more data to work for your business faster with Talend.

Vertica

Vertica is a grid-based, column-oriented database designed to manage large, fast-growing volumes of…

22 Alternatives To Apache Parquet