Cohere builds powerful models and AI solutions enabling enterprises to automate processes, empower employees, and turn fragmented data into actionable insights.
AI Inference service catalog
Showing AI Inference services from 8,000+ services. Search within this category, or choose another category or company below.
AI-powered marketing copy generator for emails, ads, blogs, and social posts. Used by 10M+ users and 4,000+ teams.
Specialized cloud provider offering massive GPU infrastructure for AI training and inference. NVIDIA Preferred CSP with 45,000+ GPUs.
Unified data and AI platform built on Apache Spark. Powers the data lakehouse with Delta Lake, MLflow, and Mosaic AI.
AI-powered speech recognition API providing fast, accurate transcription and voice AI capabilities.
深度求索(DeepSeek),成立于2023年,专注于研究世界领先的通用人工智能底层模型与技术,挑战人工智能前沿性难题。基于自研训练框架、自建智算集群和万卡算力等资源,深度求索团队仅用半年时间便已发布并开源多个百亿级参数大模型,如DeepSeek-LLM通用大语言模型、DeepSeek-Coder代码大模型,并在2024年1月率先开源国内首个MoE大模型(DeepSeek-MoE),各大模型在公开评测榜单及真实样本外的泛化效果均有超越同级别模型的出色表现。和 DeepSeek AI 对话,轻松接入 API。
Duck.ai is DuckDuckGo's privacy-focused AI chat service that provides anonymous access to multiple large language models including GPT-4o, Claude, and Llama without saving chat history or sharing data with AI providers. Users can have conversations with AI assistants while maintaining full privacy, with no account required.
Serverless AI inference platform for running diffusion models and other ML models at scale with fast response times.
High-performance AI inference platform optimized for speed and cost — serving Llama, Mixtral, and custom fine-tuned models at scale.
Google's family of open-weight large language models built from the same research as Gemini, available via API.
Google's suite of cloud computing services running on the same infrastructure that Google uses for its end-user products.
Google's most capable multimodal AI model family, available through Google AI Studio and Vertex AI.
AI platform for government and enterprise knowledge management.
AI writing assistant with grammar correction, tone detection, and generative AI writing. 30M+ daily active users across web, desktop, and IDE plugins.
AI acceleration company developing Language Processing Units (LPUs) for ultra-fast inference of large language models.
Open-source LLM observability platform for monitoring, debugging, and optimizing AI applications and API usage.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
The AI community platform for sharing models, datasets, and Spaces. Hub for 500,000+ open-source ML models.
Platform for building AI features faster with prompt management, evaluations, and fine-tuning workflows.
IBM Cloud with Red Hat offers market-leading security, enterprise scalability and open innovation to unlock the full potential of cloud and AI.
IBM's research division developing the Granite series of enterprise-grade open-source language models.
IBM's AI and data platform for enterprises providing foundation models, data management, and AI governance tools.
AI startup developing Mercury diffusion language models optimized for fast, cost-efficient text generation.
AI company developing the Inflection 3 series of models for enterprise productivity and conversational AI applications.
Showing 25–48 of 105 services