losclouds
Compare · AI Inference

What's on offer.

APIs for running AI and machine learning model inference.

Offerings

50 offerings on this page with service context, pricing, regions, and links.

Baidu
BaiduService details

Offering

ERNIE 4.5 21B A3B Thinking

Offering details
Pay-as-you-go$0.070 1M input tokens0 regions

Context Window: 131072 tokens; Input Modalities: text

Baidu
BaiduService details

Offering

ERNIE 4.5 300B A47B

Offering details
Pay-as-you-go$0.280 1M input tokens0 regions

Context Window: 123000 tokens; Input Modalities: text

Baidu
BaiduService details

Offering

ERNIE 4.5 VL 28B A3B

Offering details
Pay-as-you-go$0.140 1M input tokens0 regions

Context Window: 30000 tokens; Input Modalities: text, image

Baidu
BaiduService details

Offering

ERNIE 4.5 VL 424B A47B

Offering details
Pay-as-you-go$0.420 1M input tokens0 regions

Context Window: 123000 tokens; Input Modalities: image, text

Baidu ERNIE
Baidu ERNIEService details

Offering

Baidu ERNIE Bot & Qianfan API

Offering details
Usage-based$0.0012 per 1K tokens (CNY)2 regions

Chinese Language Excellence: true; Knowledge Grounding: true; +2 more

Baseten
BasetenService details

Offering

Baseten Model Inference

Offering details
Usage-basedFree3 regions

Truss Packaging: Open-source model packaging; GPU Support: A100, H100, T4, A10G; +3 more

BentoML
BentoMLService details

Offering

BentoML Inference Platform

Offering details
FreemiumFree4 regions

Framework Agnostic: Any ML framework; Adaptive Batching: Dynamic request batching; +3 more

Braintrust
BraintrustService details

Offering

Braintrust AI Proxy & Inference

Offering details
FreemiumFree1 region

Multi-Provider Routing: OpenAI, Anthropic, Google, Azure; Semantic Caching: Deduplicate similar requests; +2 more

Brave Browser
Brave BrowserService details

Offering

Brave Leo AI

Offering details
FreemiumFree1 region

Privacy: No conversation logging; Multiple Models: Llama 3, Mistral, Claude; +3 more

Brave Search
Brave SearchService details

Offering

Brave Search AI Summarizer

Offering details
FreemiumFree1 region

Privacy-First AI: No profile building; Cited Answers: Source attribution; +2 more

ByteDance
ByteDanceService details

Offering

Seed 1.6

Offering details
Pay-as-you-go$0.250 1M input tokens0 regions

Context Window: 262144 tokens; Input Modalities: image, text, video

ByteDance
ByteDanceService details

Offering

Seed 1.6 Flash

Offering details
Pay-as-you-go$0.075 1M input tokens0 regions

Context Window: 262144 tokens; Input Modalities: image, text, video

ByteDance
ByteDanceService details

Offering

Seed-2.0-Lite

Offering details
Pay-as-you-go$0.250 1M input tokens0 regions

Context Window: 262144 tokens; Input Modalities: text, image, video

ByteDance
ByteDanceService details

Offering

Seed-2.0-Mini

Offering details
Pay-as-you-go$0.100 1M input tokens0 regions

Context Window: 262144 tokens; Input Modalities: text, image, video

C3.ai
C3.aiService details

Offering

C3 Generative AI

Offering details
EnterpriseFree2 regions

Enterprise RAG: Retrieval over proprietary data; LLM Agnostic: OpenAI, Azure, custom models; +2 more

Cartesia
CartesiaService details

Offering

Cartesia Sonic TTS Inference

Offering details
Usage-basedFree1 region

Latency: <100ms time-to-first-audio; Streaming: WebSocket real-time; +3 more

Character.AI
Character.AIService details

Offering

Character.AI Character Platform

Offering details
FreemiumFree1 region

Character Creation: true; Community Characters: 18M+ characters; +2 more

ChatGPT
ChatGPTService details

Offering

ChatGPT Consumer & Plus

Offering details
FreemiumFree1 region

GPT-4o Access: true; Memory: true; +3 more

Clarifai
ClarifaiService details

Offering

Clarifai AI Inference Platform

Offering details
Usage-basedFree1 region

Model Marketplace: 1,500+ models; Fine-Tuning: Custom training; +3 more

Claude API
Claude APIService details

Offering

Claude API by Anthropic

Offering details
Usage-based$0.0003 per 1K input tokens1 region

Context Window: 200K tokens; Tool Use: true; +3 more

Claude API
Claude APIService details

Offering

Claude API - Standard

Offering details
Usage-based$0.003 per 1K input tokens3 regions

Context Window: 200K tokens; Tool Use: Yes; +2 more

Claude API
Claude APIService details

Offering

Claude Batch API

Offering details
Usage-based$0.0002 per 1K input tokens2 regions

Cost Savings: 50% off; Processing Window: 24 hours; +2 more

Cloudflare
CloudflareService details

Offering

Cloudflare Workers AI

Offering details
Usage-basedFree1 region

Edge Inference: 300+ locations; Model Catalog: 100+ models; +3 more

Codeium
CodeiumService details

Offering

Codeium Enterprise AI

Offering details
EnterpriseFree1 region

On-premise Models: Air-gapped deployment; Codebase Indexing: RAG over your repos; +3 more

Cohere
CohereService details

Offering

Cohere Enterprise NLP API

Offering details
Usage-basedFree3 regions

Command R+ RAG: true; Embed v3: true; +3 more

Cohere
CohereService details

Offering

Command A

Offering details
Pay-as-you-go$2.50 1M input tokens0 regions

Context Window: 256000 tokens; Input Modalities: text

Cohere
CohereService details

Offering

Command A

Offering details
Pay-as-you-go$2.50 1M input tokens0 regions

Context Window: 256000 tokens; Input Modalities: text

Cohere
CohereService details

Offering

Command A Reasoning

Offering details
CustomPrice pending0 regions

Context Window: 256000 tokens; Input Modalities: text

Cohere
CohereService details

Offering

Command R

Offering details
Pay-as-you-go$0.150 1M input tokens0 regions

Context Window: 128000 tokens; Input Modalities: text

Cohere
CohereService details

Offering

Command R+

Offering details
Pay-as-you-go$2.50 1M input tokens0 regions

Context Window: 128000 tokens; Input Modalities: text

Cohere
CohereService details

Offering

Command R+ (08-2024)

Offering details
Pay-as-you-go$2.50 1M input tokens0 regions

Context Window: 128000 tokens; Input Modalities: text

Cohere
CohereService details

Offering

Command R7B

Offering details
Pay-as-you-go$0.0375 1M input tokens0 regions

Context Window: 128000 tokens; Input Modalities: text

Copy.ai
Copy.aiService details

Offering

Copy.ai Marketing Content Platform

Offering details
FreemiumFree1 region

Marketing Workflows: 100+; Brand Voice: true; +2 more

CoreWeave
CoreWeaveService details

Offering

CoreWeave AI Inference

Offering details
Usage-based$2.21 per GPU hour3 regions

GPU Types: H100, A100, L40S, A40; Scaling: Kubernetes auto-scaling; +3 more

Coze
CozeService details

Offering

Coze Bot API

Offering details
FreemiumFree1 region

Models: GPT-4o, Claude, Gemini, Doubao; Plugins: 900+; +2 more

CrewAI
CrewAIService details

Offering

CrewAI Enterprise

Offering details
SubscriptionFree1 region

Managed Execution: true; Monitoring: true; +2 more

Cursor
CursorService details

Offering

Cursor AI Models

Offering details
SubscriptionFree1 region

Models: Claude 3.7, GPT-4o, Gemini 1.5; cursor-small: Fast + cheap; +2 more

Cursor AI
Cursor AIService details

Offering

Cursor AI Completions

Offering details
SubscriptionFree1 region

Multi-line Completion: true; Context: Full codebase; +2 more

d-Matrix
d-MatrixService details

Offering

d-Matrix Inference Platform

Offering details
SubscriptionFree1 region

Model Support: Llama, Mistral, GPT-J+; Compiler: Automatic optimization; +2 more

Daily.co
Daily.coService details

Offering

Daily AI (Real-time Voice AI)

Offering details
Usage-basedFree1 region

Latency: <300ms end-to-end; Model Agnostic: Any STT/LLM/TTS; +2 more

Databricks
DatabricksService details

Offering

Databricks Model Serving

Offering details
Usage-based$0.070 per DBU hour4 regions

Model Support: Custom + foundation models; Auto-Scaling: Scale to zero; +3 more

DataRobot
DataRobotService details

Offering

DataRobot MLOps & Deployment

Offering details
SubscriptionFree3 regions

Auto-ML: true; Champion/Challenger: true; +2 more

Deepgram
DeepgramService details

Offering

Deepgram AI Speech API

Offering details
Usage-basedFree1 region

Accuracy: Industry-leading WER; Speed: Up to 50x real-time; +3 more

DeepSeek
DeepSeekService details

Offering

DeepSeek Chat

Offering details
Pay-as-you-go$0.270 1M input tokens0 regions

Context Window: 128000 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek V3 0324

Offering details
Pay-as-you-go$0.200 1M input tokens0 regions

Context Window: 163840 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek V3.1

Offering details
Pay-as-you-go$0.150 1M input tokens0 regions

Context Window: 32768 tokens; Input Modalities: text

Showing 51–100 of 515 offerings