Open-source data integration platform with 350+ connectors and self-host or cloud options.
AI Data Platforms service catalog
Showing AI Data Platforms services from 8,000+ services. Search within this category, or choose another category or company below.
Comprehensive cloud platform offering over 200 services from data centers globally. Market leader in IaaS.
Apify is a web scraping and browser-automation platform offering a marketplace of pre-built actors, scheduled crawlers, and an API for turning any website into structured data.
Open-source tool for fine-tuning large language models supporting various training techniques including QLoRA and LoRA.
AI drug discovery company using machine learning to identify and develop new pharmaceutical treatments.
Enterprise AI evaluation and observability platform for logging, testing, and scoring LLM application outputs.
Full-lifecycle AI platform for computer vision, NLP, and audio processing with model training and deployment.
MLOps platform for tracking, comparing, explaining, and optimizing machine learning experiments and models.
Unified data and AI platform built on Apache Spark. Powers the data lakehouse with Delta Lake, MLflow, and Mosaic AI.
Collaborative data science and AI platform for building and deploying machine learning models.
Enterprise AI platform that automates machine learning model building, deployment, and monitoring.
Data transformation company behind dbt. Turns SQL SELECT statements into production-grade data pipelines.
Deep learning training platform providing hyperparameter tuning, distributed training, and experiment management.
Data-centric AI platform for labeling, managing, and analyzing training data for computer vision models.
MLOps platform for managing, reproducing, and deploying machine learning experiments and pipelines.
Open-source ML observability framework and platform for monitoring data quality and model performance.
Exa is a neural search API designed for AI agents, returning semantically relevant web results and structured datasets for retrieval-augmented generation pipelines.
AI observability platform providing explainability, monitoring, and fairness analysis for ML models in production.
Firecrawl is an API that turns websites into LLM-ready markdown via crawling, scraping, and structured extraction, designed for retrieval-augmented generation and AI agent workflows.
Automated data movement platform with 500+ connectors. Managed ELT pipelines for data warehouses.
Google's suite of cloud computing services running on the same infrastructure that Google uses for its end-user products.
Open-source AI platform providing AutoML, machine learning, and generative AI solutions for enterprises.
The AI community platform for sharing models, datasets, and Spaces. Hub for 500,000+ open-source ML models.
IBM's AI and data platform for enterprises providing foundation models, data management, and AI governance tools.
Showing 1–24 of 53 services