Apache Druid is an open-source real-time analytics database designed for fast slice-and-dice analytics on large datasets. It combines the strengths of a time-series database, a search system, and a column-oriented data store, making it suitable for sub-second OLAP queries on streaming data ingestion.
Apache Software Foundation Data Warehouse service catalog
Showing Apache Software Foundation services in Data Warehouse from 8,000+ services. Search within this scoped route, or loosen the category and company filters below.
Apache Pinot is a real-time distributed OLAP datastore built to deliver scalable real-time analytics with low latency on streaming and batch data. Originally developed at LinkedIn and donated to the ASF, Pinot powers user-facing analytics on platforms with billions of events per day, including LinkedIn, Uber, Stripe, and Walmart.
Apache Spark is an open-source unified analytics engine for large-scale data processing with built-in modules for SQL, streaming, machine learning, and graph processing. Created at UC Berkeley in 2009, Spark replaced Hadoop MapReduce as the dominant batch processing engine and underpins commercial platforms including Databricks, AWS EMR, and Google Dataproc.