Apache Parquet
Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem.
22 Alternatives To Apache Parquet
AWS Glue
Fully managed extract, transform, and load (ETL) service
Amazon Athena
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.
Amazon EMR
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
Amazon Redshift
Learn about Amazon Redshift cloud data warehouse.
Apache Druid
Fast column-oriented distributed data store
Apache Hive
Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
Apache Kudu
Apache Kudu is Hadoop’s storage layer to enable fast analytics on fast data.
Apache Kylin
OLAP Engine for Big Data
Apache Spark
Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
BlueData
BlueData’s software platform makes it easier, faster and more cost-effective for organizations to deploy Big Data infrastructure on-premises.
Chartio
Chartio is a powerful business intelligence tool that anyone can use.
Hortonworks
Hadoop-Related
Impala
Impala is a modern, open source, distributed SQL query engine for Apache Hadoop.
Looker
Looker makes it easy for analysts to create and curate custom data experiences—so everyone in the business can explore the data that matters to them, in the context that makes it truly meaningful.
Panoply
Panoply is a smart cloud data warehouse
Presto DB
Distributed SQL Query Engine for Big Data (by Facebook)
Qubole
Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.
RJ Metrics
RJMetrics provides hosted business intelligence & data analysis software to companies that operate online.
Segment
We make customer data simple.
Talend
Talend Cloud delivers a single, open platform for data integration across cloud and on-premises environments. Put more data to work for your business faster with Talend.
Treasure Data
Treasure Data is an end-to-end, fully-managed cloud service for big data.
Vertica
Vertica is a grid-based, column-oriented database designed to manage large, fast-growing volumes of…