Apache Airflow is an open-source platform to programmatically author, schedule, and monitor data pipelines as directed acyclic graphs (DAGs). Originally created at Airbnb in 2014 and donated to the ASF, it is the de facto standard for batch workflow orchestration with a rich ecosystem of providers for cloud services, databases, and SaaS systems.
Apache Software Foundation service catalog
Showing Apache Software Foundation services from 8,000+ services. Search within this company view, or use the filters below to narrow by category.
Open-source distributed NoSQL database for handling large amounts of data across many servers with high availability.
Open-source document-oriented database with multi-master sync and HTTP API access.
Apache Druid is an open-source real-time analytics database designed for fast slice-and-dice analytics on large datasets. It combines the strengths of a time-series database, a search system, and a column-oriented data store, making it suitable for sub-second OLAP queries on streaming data ingestion.
Apache ECharts is an open-source JavaScript data visualization library that provides interactive charts, graphs, and maps for web applications. It supports over 20 chart types including line, bar, scatter, pie, candlestick, and geographic maps, with built-in support for large datasets and dynamic data streaming.
Apache HTTP Server (httpd) is the open-source cross-platform web server software developed under the Apache Software Foundation. Since 1995 it has been one of the most widely deployed web servers, supporting modules for SSL/TLS, URL rewriting, virtual hosts, reverse proxying, and dozens of authentication backends. Apache pioneered the modular plug-in model that influenced nearly every modern HTTP server.
Apache Kafka is an open-source distributed event streaming platform used for high-throughput, low-latency data pipelines, streaming analytics, and event-driven architectures. Originally developed at LinkedIn and contributed to the ASF in 2011, Kafka is the backbone of streaming data infrastructure at thousands of companies and underlies many managed services like Confluent Cloud and AWS MSK.
Open-source data integration platform providing automated, scalable data flows between systems with a web UI.
Apache OpenOffice is a free and open-source desktop office productivity suite including word processing, spreadsheet, presentation, drawing, and database applications.
Apache Pinot is a real-time distributed OLAP datastore built to deliver scalable real-time analytics with low latency on streaming and batch data. Originally developed at LinkedIn and donated to the ASF, Pinot powers user-facing analytics on platforms with billions of events per day, including LinkedIn, Uber, Stripe, and Walmart.
Apache Pulsar is a cloud-native distributed messaging and streaming platform combining pub-sub messaging, queuing, and event streaming in a single unified system. Originally developed at Yahoo and donated to the ASF in 2016, Pulsar separates the serving layer (brokers) from the storage layer (Apache BookKeeper), enabling independent scaling, multi-tenancy, geo-replication, and tiered storage to object stores.
Open-source enterprise search platform built on Apache Lucene providing full-text search and near real-time indexing.
Apache Spark is an open-source unified analytics engine for large-scale data processing with built-in modules for SQL, streaming, machine learning, and graph processing. Created at UC Berkeley in 2009, Spark replaced Hadoop MapReduce as the dominant batch processing engine and underpins commercial platforms including Databricks, AWS EMR, and Google Dataproc.
Apache Superset is an open-source modern data exploration and visualization platform offering a no-code interface for chart building, a SQL editor, and an extensible visualization plug-in system. Originally created at Airbnb and donated to the ASF, Superset is widely deployed as a self-hostable BI alternative and is the foundation of the commercial Preset Cloud offering.