Offering
ERNIE 4.5 21B A3B Thinking
Offering detailsContext Window: 131072 tokens; Input Modalities: text
APIs for running AI and machine learning model inference.
50 offerings on this page with service context, pricing, regions, and links.
Offering
ERNIE 4.5 21B A3B Thinking
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
ERNIE 4.5 300B A47B
Offering detailsContext Window: 123000 tokens; Input Modalities: text
Offering
ERNIE 4.5 VL 28B A3B
Offering detailsContext Window: 30000 tokens; Input Modalities: text, image
Offering
ERNIE 4.5 VL 424B A47B
Offering detailsContext Window: 123000 tokens; Input Modalities: image, text
Offering
Baidu ERNIE Bot & Qianfan API
Offering detailsChinese Language Excellence: true; Knowledge Grounding: true; +2 more
Offering
ERNIE Speed
Offering detailsLanguages: Chinese, English; Context: 128K tokens; +1 more
Offering
Baseten Model Inference
Offering detailsTruss Packaging: Open-source model packaging; GPU Support: A100, H100, T4, A10G; +3 more
Offering
BentoML Inference Platform
Offering detailsFramework Agnostic: Any ML framework; Adaptive Batching: Dynamic request batching; +3 more
Offering
Braintrust AI Proxy & Inference
Offering detailsMulti-Provider Routing: OpenAI, Anthropic, Google, Azure; Semantic Caching: Deduplicate similar requests; +2 more
Offering
Brave Leo AI
Offering detailsPrivacy: No conversation logging; Multiple Models: Llama 3, Mistral, Claude; +3 more
Offering
Brave Search AI Summarizer
Offering detailsPrivacy-First AI: No profile building; Cited Answers: Source attribution; +2 more
Offering
Seed 1.6
Offering detailsContext Window: 262144 tokens; Input Modalities: image, text, video
Offering
Seed 1.6 Flash
Offering detailsContext Window: 262144 tokens; Input Modalities: image, text, video
Offering
Seed-2.0-Lite
Offering detailsContext Window: 262144 tokens; Input Modalities: text, image, video
Offering
Seed-2.0-Mini
Offering detailsContext Window: 262144 tokens; Input Modalities: text, image, video
Offering
C3 Generative AI
Offering detailsEnterprise RAG: Retrieval over proprietary data; LLM Agnostic: OpenAI, Azure, custom models; +2 more
Offering
Cartesia Sonic TTS Inference
Offering detailsLatency: <100ms time-to-first-audio; Streaming: WebSocket real-time; +3 more
Offering
Cerebras Inference API
Offering detailsSpeed: 2,000+ tokens/second; API Compatibility: OpenAI-compatible; +3 more
Offering
Character.AI Character Platform
Offering detailsCharacter Creation: true; Community Characters: 18M+ characters; +2 more
Offering
ChatGPT Consumer & Plus
Offering detailsGPT-4o Access: true; Memory: true; +3 more
Offering
ChatGPT Enterprise
Offering detailsData Privacy: No training on data; Unlimited GPT-4o: true; +3 more
Offering
Clarifai AI Inference Platform
Offering detailsModel Marketplace: 1,500+ models; Fine-Tuning: Custom training; +3 more
Offering
Claude API by Anthropic
Offering detailsContext Window: 200K tokens; Tool Use: true; +3 more
Offering
Claude API - Standard
Offering detailsContext Window: 200K tokens; Tool Use: Yes; +2 more
Offering
Claude Batch API
Offering detailsCost Savings: 50% off; Processing Window: 24 hours; +2 more
Offering
Claude for Enterprise
Offering detailsSAML SSO: true; Audit Logs: true; +3 more
Offering
Cloudflare Workers AI
Offering detailsEdge Inference: 300+ locations; Model Catalog: 100+ models; +3 more
Offering
Codeium Enterprise AI
Offering detailsOn-premise Models: Air-gapped deployment; Codebase Indexing: RAG over your repos; +3 more
Offering
Cohere Enterprise NLP API
Offering detailsCommand R+ RAG: true; Embed v3: true; +3 more
Offering
Command A
Offering detailsContext Window: 256000 tokens; Input Modalities: text
Offering
Command A
Offering detailsContext Window: 256000 tokens; Input Modalities: text
Offering
Command A Reasoning
Offering detailsContext Window: 256000 tokens; Input Modalities: text
Offering
Command R
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Command R+
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Command R+ (08-2024)
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Command R7B
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Copy.ai Marketing Content Platform
Offering detailsMarketing Workflows: 100+; Brand Voice: true; +2 more
Offering
CoreWeave AI Inference
Offering detailsGPU Types: H100, A100, L40S, A40; Scaling: Kubernetes auto-scaling; +3 more
Offering
Coze Bot API
Offering detailsModels: GPT-4o, Claude, Gemini, Doubao; Plugins: 900+; +2 more
Offering
CrewAI Enterprise
Offering detailsManaged Execution: true; Monitoring: true; +2 more
Offering
Cursor AI Models
Offering detailsModels: Claude 3.7, GPT-4o, Gemini 1.5; cursor-small: Fast + cheap; +2 more
Offering
Cursor AI Completions
Offering detailsMulti-line Completion: true; Context: Full codebase; +2 more
Offering
d-Matrix Inference Platform
Offering detailsModel Support: Llama, Mistral, GPT-J+; Compiler: Automatic optimization; +2 more
Offering
Daily AI (Real-time Voice AI)
Offering detailsLatency: <300ms end-to-end; Model Agnostic: Any STT/LLM/TTS; +2 more
Offering
Databricks Model Serving
Offering detailsModel Support: Custom + foundation models; Auto-Scaling: Scale to zero; +3 more
Offering
DataRobot MLOps & Deployment
Offering detailsAuto-ML: true; Champion/Challenger: true; +2 more
Offering
Deepgram AI Speech API
Offering detailsAccuracy: Industry-leading WER; Speed: Up to 50x real-time; +3 more
Offering
DeepSeek Chat
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
DeepSeek V3 0324
Offering detailsContext Window: 163840 tokens; Input Modalities: text
Offering
DeepSeek V3.1
Offering detailsContext Window: 32768 tokens; Input Modalities: text
| Service | Offering | Pricing model | Starting price | Regions | Features | Links |
|---|---|---|---|---|---|---|
BaiduService details | ERNIE 4.5 21B A3B Thinking Offering details | Pay-as-you-go | $0.070 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
BaiduService details | ERNIE 4.5 300B A47B Offering details | Pay-as-you-go | $0.280 1M input tokens | 0 | Context Window: 123000 tokens; Input Modalities: text | |
BaiduService details | ERNIE 4.5 VL 28B A3B Offering details | Pay-as-you-go | $0.140 1M input tokens | 0 | Context Window: 30000 tokens; Input Modalities: text, image | |
BaiduService details | ERNIE 4.5 VL 424B A47B Offering details | Pay-as-you-go | $0.420 1M input tokens | 0 | Context Window: 123000 tokens; Input Modalities: image, text | |
Baidu ERNIEService details | Baidu ERNIE Bot & Qianfan API Offering details | Usage-based | $0.0012 per 1K tokens (CNY) | 2 | Chinese Language Excellence: true; Knowledge Grounding: true; +2 more | |
Baidu ERNIEService details | ERNIE Speed Offering details | Usage-based | Free | 2 | Languages: Chinese, English; Context: 128K tokens; +1 more | |
BasetenService details | Baseten Model Inference Offering details | Usage-based | Free | 3 | Truss Packaging: Open-source model packaging; GPU Support: A100, H100, T4, A10G; +3 more | |
BentoMLService details | BentoML Inference Platform Offering details | Freemium | Free | 4 | Framework Agnostic: Any ML framework; Adaptive Batching: Dynamic request batching; +3 more | |
BraintrustService details | Braintrust AI Proxy & Inference Offering details | Freemium | Free | 1 | Multi-Provider Routing: OpenAI, Anthropic, Google, Azure; Semantic Caching: Deduplicate similar requests; +2 more | |
Brave BrowserService details | Brave Leo AI Offering details | Freemium | Free | 1 | Privacy: No conversation logging; Multiple Models: Llama 3, Mistral, Claude; +3 more | |
Brave SearchService details | Brave Search AI Summarizer Offering details | Freemium | Free | 1 | Privacy-First AI: No profile building; Cited Answers: Source attribution; +2 more | |
ByteDanceService details | Seed 1.6 Offering details | Pay-as-you-go | $0.250 1M input tokens | 0 | Context Window: 262144 tokens; Input Modalities: image, text, video | |
ByteDanceService details | Seed 1.6 Flash Offering details | Pay-as-you-go | $0.075 1M input tokens | 0 | Context Window: 262144 tokens; Input Modalities: image, text, video | |
ByteDanceService details | Seed-2.0-Lite Offering details | Pay-as-you-go | $0.250 1M input tokens | 0 | Context Window: 262144 tokens; Input Modalities: text, image, video | |
ByteDanceService details | Seed-2.0-Mini Offering details | Pay-as-you-go | $0.100 1M input tokens | 0 | Context Window: 262144 tokens; Input Modalities: text, image, video | |
C3.aiService details | C3 Generative AI Offering details | Enterprise | Free | 2 | Enterprise RAG: Retrieval over proprietary data; LLM Agnostic: OpenAI, Azure, custom models; +2 more | |
CartesiaService details | Cartesia Sonic TTS Inference Offering details | Usage-based | Free | 1 | Latency: <100ms time-to-first-audio; Streaming: WebSocket real-time; +3 more | |
Cerebras SystemsService details | Cerebras Inference API Offering details | Usage-based | Free | 1 | Speed: 2,000+ tokens/second; API Compatibility: OpenAI-compatible; +3 more | |
Character.AIService details | Character.AI Character Platform Offering details | Freemium | Free | 1 | Character Creation: true; Community Characters: 18M+ characters; +2 more | |
ChatGPTService details | ChatGPT Consumer & Plus Offering details | Freemium | Free | 1 | GPT-4o Access: true; Memory: true; +3 more | |
ChatGPT EnterpriseService details | ChatGPT Enterprise Offering details | Enterprise | Free | 1 | Data Privacy: No training on data; Unlimited GPT-4o: true; +3 more | |
ClarifaiService details | Clarifai AI Inference Platform Offering details | Usage-based | Free | 1 | Model Marketplace: 1,500+ models; Fine-Tuning: Custom training; +3 more | |
Claude APIService details | Claude API by Anthropic Offering details | Usage-based | $0.0003 per 1K input tokens | 1 | Context Window: 200K tokens; Tool Use: true; +3 more | |
Claude APIService details | Claude API - Standard Offering details | Usage-based | $0.003 per 1K input tokens | 3 | Context Window: 200K tokens; Tool Use: Yes; +2 more | |
Claude APIService details | Claude Batch API Offering details | Usage-based | $0.0002 per 1K input tokens | 2 | Cost Savings: 50% off; Processing Window: 24 hours; +2 more | |
Claude for EnterpriseService details | Claude for Enterprise Offering details | Enterprise | Free | 1 | SAML SSO: true; Audit Logs: true; +3 more | |
CloudflareService details | Cloudflare Workers AI Offering details | Usage-based | Free | 1 | Edge Inference: 300+ locations; Model Catalog: 100+ models; +3 more | |
CodeiumService details | Codeium Enterprise AI Offering details | Enterprise | Free | 1 | On-premise Models: Air-gapped deployment; Codebase Indexing: RAG over your repos; +3 more | |
CohereService details | Cohere Enterprise NLP API Offering details | Usage-based | Free | 3 | Command R+ RAG: true; Embed v3: true; +3 more | |
CohereService details | Command A Offering details | Pay-as-you-go | $2.50 1M input tokens | 0 | Context Window: 256000 tokens; Input Modalities: text | |
CohereService details | Command A Offering details | Pay-as-you-go | $2.50 1M input tokens | 0 | Context Window: 256000 tokens; Input Modalities: text | |
CohereService details | Command A Reasoning Offering details | Custom | — | 0 | Context Window: 256000 tokens; Input Modalities: text | |
CohereService details | Command R Offering details | Pay-as-you-go | $0.150 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
CohereService details | Command R+ Offering details | Pay-as-you-go | $2.50 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
CohereService details | Command R+ (08-2024) Offering details | Pay-as-you-go | $2.50 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
CohereService details | Command R7B Offering details | Pay-as-you-go | $0.0375 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Copy.aiService details | Copy.ai Marketing Content Platform Offering details | Freemium | Free | 1 | Marketing Workflows: 100+; Brand Voice: true; +2 more | |
CoreWeaveService details | CoreWeave AI Inference Offering details | Usage-based | $2.21 per GPU hour | 3 | GPU Types: H100, A100, L40S, A40; Scaling: Kubernetes auto-scaling; +3 more | |
CozeService details | Coze Bot API Offering details | Freemium | Free | 1 | Models: GPT-4o, Claude, Gemini, Doubao; Plugins: 900+; +2 more | |
CrewAIService details | CrewAI Enterprise Offering details | Subscription | Free | 1 | Managed Execution: true; Monitoring: true; +2 more | |
CursorService details | Cursor AI Models Offering details | Subscription | Free | 1 | Models: Claude 3.7, GPT-4o, Gemini 1.5; cursor-small: Fast + cheap; +2 more | |
Cursor AIService details | Cursor AI Completions Offering details | Subscription | Free | 1 | Multi-line Completion: true; Context: Full codebase; +2 more | |
d-MatrixService details | d-Matrix Inference Platform Offering details | Subscription | Free | 1 | Model Support: Llama, Mistral, GPT-J+; Compiler: Automatic optimization; +2 more | |
Daily.coService details | Daily AI (Real-time Voice AI) Offering details | Usage-based | Free | 1 | Latency: <300ms end-to-end; Model Agnostic: Any STT/LLM/TTS; +2 more | |
DatabricksService details | Databricks Model Serving Offering details | Usage-based | $0.070 per DBU hour | 4 | Model Support: Custom + foundation models; Auto-Scaling: Scale to zero; +3 more | |
DataRobotService details | DataRobot MLOps & Deployment Offering details | Subscription | Free | 3 | Auto-ML: true; Champion/Challenger: true; +2 more | |
DeepgramService details | Deepgram AI Speech API Offering details | Usage-based | Free | 1 | Accuracy: Industry-leading WER; Speed: Up to 50x real-time; +3 more | |
DeepSeekService details | DeepSeek Chat Offering details | Pay-as-you-go | $0.270 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
DeepSeekService details | DeepSeek V3 0324 Offering details | Pay-as-you-go | $0.200 1M input tokens | 0 | Context Window: 163840 tokens; Input Modalities: text | |
DeepSeekService details | DeepSeek V3.1 Offering details | Pay-as-you-go | $0.150 1M input tokens | 0 | Context Window: 32768 tokens; Input Modalities: text |
Showing 51–100 of 515 offerings