Service
Replicate
Offering
Replicate Open-Source AI Model API
Review provider-published pricing references for AI Inference. Rows stay unsorted by price when billing units do not match.
Mixed units
Providers publish these rows in different billing units, so losclouds keeps the original labels and falls back to alphabetical order.
50 offerings on this page, listed alphabetically because starting-price units are mixed.
Service
Replicate
Offering
Replicate Open-Source AI Model API
Service
Salesforce
Offering
Salesforce Einstein AI
Service
SAP Business Technology Platform
Offering
SAP AI Core
Service
Stability AI API
Offering
Stability AI API - Stable Diffusion
Service
Tencent Cloud AI
Offering
Tencent Cloud AI — NLP, Vision & LLM APIs
Service
Tenstorrent
Offering
Tenstorrent AI Inference Cloud
Service
Together AI
Offering
Together AI — Open-Source Model Inference Platform
Service
Upstage
Offering
Upstage Solar API — Enterprise LLM & Document AI
Service
Writer
Offering
Writer — Full-Stack Enterprise Generative AI
| Service | Offering | Model | Starting Price↕ | Details |
|---|---|---|---|---|
| Replicate | Replicate Open-Source AI Model API | usage-based | $0.000225per second CPU (varies by model) | Pricing |
| Resemble AI | Resemble AI Voice API | usage-based | $0.006per second of audio | Pricing |
| RunPod | RunPod Serverless AI Inference | usage-based | $0.00022per second (RTX 3090) | Pricing |
| Runway Research | Runway Research API | usage-based | $0.05per generation credit | Pricing |
| Salesforce | Salesforce Einstein AI | subscription | $50per user/month (Einstein for Sales) | Pricing |
| SambaNova Systems | SambaNova Cloud AI Inference | usage-based | $0.0005per 1K tokens | Pricing |
| SAP Business Technology Platform | SAP AI Core | usage-based | $0.02per capability unit hour | Pricing |
| Scale AI | Scale Generative AI Platform | usage-based | Free | Pricing |
| Snorkel AI | Snorkel Flow Model Training | subscription | Free | Pricing |
| Snowflake | Snowflake Cortex AI | usage-based | $0.04per 1M tokens (Llama 3 8B) | Pricing |
| Sora | Sora Video Generation API | subscription | $20per month (ChatGPT Plus) | Pricing |
| Stability AI | Stability AI Inference API | usage-based | $0.065per image (SD3.5) | Pricing |
| Stability AI API | Stability AI API - Stable Diffusion | usage-based | $0.003per image (SDXL 512px) | Pricing |
| StepFun | Step 3.5 Flash | pay-as-you-go | $0.11M input tokens | Pricing |
| StepFun | Step 3.5 Flash | free | — | Pricing |
| Tabnine | Tabnine AI Code Inference | freemium | Free | Pricing |
| Tencent | Hunyuan A13B Instruct | pay-as-you-go | $0.141M input tokens | Pricing |
| Tencent Cloud AI | Tencent Cloud AI — NLP, Vision & LLM APIs | usage-based | $0.0008per 1000 tokens (Hunyuan Lite) | Pricing |
| Tenstorrent | Tenstorrent AI Inference Cloud | usage-based | $0.5per hour (Wormhole card) | Pricing |
| Together AI | DeepSeek V3.1 on Together AI | pay-as-you-go | $0.61M input tokens | Pricing |
| Together AI | GLM-5 on Together AI | pay-as-you-go | $11M input tokens | Pricing |
| Together AI | GPT-OSS 120B on Together AI | pay-as-you-go | $0.151M input tokens | Pricing |
| Together AI | GPT-OSS 20B on Together AI | pay-as-you-go | $0.051M input tokens | Pricing |
| Together AI | Kimi K2.5 on Together AI | pay-as-you-go | $0.51M input tokens | Pricing |
| Together AI | Llama 4 Maverick on Together AI | pay-as-you-go | $0.271M input tokens | Pricing |
| Together AI | Qwen3 Coder 480B on Together AI | pay-as-you-go | $21M input tokens | Pricing |
| Together AI | Qwen3.5 397B on Together AI | pay-as-you-go | $0.61M input tokens | Pricing |
| Together AI | Together AI — Open-Source Model Inference Platform | usage-based | $0.0001per 1M tokens (Llama 3.2 8B) | Pricing |
| TSMC | TSMC AI Chip Fabrication | custom | $20000per wafer (N5 process) | Pricing |
| Unsloth | Unsloth AI Inference Optimization | freemium | Free | Pricing |
| Unsloth | Unsloth Enterprise | subscription | Free | Pricing |
| Upstage | Upstage Solar API — Enterprise LLM & Document AI | usage-based | $0.0001per 1K tokens (Solar Mini) | Pricing |
| Vercel | Vercel AI SDK & Inference | freemium | Free | Pricing |
| vLLM | vLLM — High-Throughput LLM Inference Engine | open-source | Free | Pricing |
| WhyLabs | WhyLabs LLM Monitoring | freemium | Free | Pricing |
| Writer | Palmyra X5 | pay-as-you-go | $0.61M input tokens | Pricing |
| Writer | Writer — Full-Stack Enterprise Generative AI | subscription | $18per user per month | Pricing |
| Writesonic | Writesonic — AI Writing & SEO Platform | freemium | Free | Pricing |
| xAI | Grok (xAI) | usage-based | $2e-7per input token (grok-4-1-fast) | Pricing |
| xAI | Grok 3 | pay-as-you-go | $31M input tokens | Pricing |
| xAI | Grok 3 Beta | pay-as-you-go | $31M input tokens | Pricing |
| xAI | Grok 3 Mini | pay-as-you-go | $0.31M input tokens | Pricing |
| xAI | Grok 3 Mini Beta | pay-as-you-go | $0.31M input tokens | Pricing |
| xAI | Grok 4 | pay-as-you-go | $31M input tokens | Pricing |
| xAI | Grok 4 Fast | pay-as-you-go | $0.21M input tokens | Pricing |
| xAI | Grok 4.1 Fast | pay-as-you-go | $0.21M input tokens | Pricing |
| xAI | Grok 4.20 | pay-as-you-go | $21M input tokens | Pricing |
| xAI | Grok 4.20 Beta | pay-as-you-go | $31M input tokens | Pricing |
| xAI | Grok 4.20 Multi-Agent | pay-as-you-go | $21M input tokens | Pricing |
| xAI | xAI API (Grok Models) | usage-based | $0.0002per 1K input tokens (Grok-2-Mini) | Pricing |
Showing 451–500 of 515 offerings