losclouds
Compare · AI Inference · cost

Pricing reference rows.

Review provider-published pricing references for AI Inference. Rows stay unsorted by price when billing units do not match.

Mixed units

Starting prices are shown as references, not a cheapest ranking.

Providers publish these rows in different billing units, so losclouds keeps the original labels and falls back to alphabetical order.

Pricing comparison

50 offerings on this page, listed alphabetically because starting-price units are mixed.

Service

Replicate

Offering

Replicate Open-Source AI Model API

usage-based$0.000225per second CPU (varies by model)
Pricing

Service

Resemble AI

Offering

Resemble AI Voice API

usage-based$0.006per second of audio
Pricing

Service

RunPod

Offering

RunPod Serverless AI Inference

usage-based$0.00022per second (RTX 3090)
Pricing

Service

Runway Research

Offering

Runway Research API

usage-based$0.05per generation credit
Pricing

Service

Salesforce

Offering

Salesforce Einstein AI

subscription$50per user/month (Einstein for Sales)
Pricing

Service

SambaNova Systems

Offering

SambaNova Cloud AI Inference

usage-based$0.0005per 1K tokens
Pricing

Service

SAP Business Technology Platform

Offering

SAP AI Core

usage-based$0.02per capability unit hour
Pricing

Service

Scale AI

Offering

Scale Generative AI Platform

usage-basedFree
Pricing

Service

Snorkel AI

Offering

Snorkel Flow Model Training

subscriptionFree
Pricing

Service

Snowflake

Offering

Snowflake Cortex AI

usage-based$0.04per 1M tokens (Llama 3 8B)
Pricing

Service

Sora

Offering

Sora Video Generation API

subscription$20per month (ChatGPT Plus)
Pricing

Service

Stability AI

Offering

Stability AI Inference API

usage-based$0.065per image (SD3.5)
Pricing

Service

Stability AI API

Offering

Stability AI API - Stable Diffusion

usage-based$0.003per image (SDXL 512px)
Pricing

Service

StepFun

Offering

Step 3.5 Flash

pay-as-you-go$0.11M input tokens
Pricing

Service

StepFun

Offering

Step 3.5 Flash

freePrice pending
Pricing

Service

Tabnine

Offering

Tabnine AI Code Inference

freemiumFree
Pricing

Service

Tencent

Offering

Hunyuan A13B Instruct

pay-as-you-go$0.141M input tokens
Pricing

Service

Tencent Cloud AI

Offering

Tencent Cloud AI — NLP, Vision & LLM APIs

usage-based$0.0008per 1000 tokens (Hunyuan Lite)
Pricing

Service

Tenstorrent

Offering

Tenstorrent AI Inference Cloud

usage-based$0.5per hour (Wormhole card)
Pricing

Service

Together AI

Offering

DeepSeek V3.1 on Together AI

pay-as-you-go$0.61M input tokens
Pricing

Service

Together AI

Offering

GLM-5 on Together AI

pay-as-you-go$11M input tokens
Pricing

Service

Together AI

Offering

GPT-OSS 120B on Together AI

pay-as-you-go$0.151M input tokens
Pricing

Service

Together AI

Offering

GPT-OSS 20B on Together AI

pay-as-you-go$0.051M input tokens
Pricing

Service

Together AI

Offering

Kimi K2.5 on Together AI

pay-as-you-go$0.51M input tokens
Pricing

Service

Together AI

Offering

Llama 4 Maverick on Together AI

pay-as-you-go$0.271M input tokens
Pricing

Service

Together AI

Offering

Qwen3 Coder 480B on Together AI

pay-as-you-go$21M input tokens
Pricing

Service

Together AI

Offering

Qwen3.5 397B on Together AI

pay-as-you-go$0.61M input tokens
Pricing

Service

Together AI

Offering

Together AI — Open-Source Model Inference Platform

usage-based$0.0001per 1M tokens (Llama 3.2 8B)
Pricing

Service

TSMC

Offering

TSMC AI Chip Fabrication

custom$20000per wafer (N5 process)
Pricing

Service

Unsloth

Offering

Unsloth AI Inference Optimization

freemiumFree
Pricing

Service

Unsloth

Offering

Unsloth Enterprise

subscriptionFree
Pricing

Service

Upstage

Offering

Upstage Solar API — Enterprise LLM & Document AI

usage-based$0.0001per 1K tokens (Solar Mini)
Pricing

Service

Vercel

Offering

Vercel AI SDK & Inference

freemiumFree
Pricing

Service

vLLM

Offering

vLLM — High-Throughput LLM Inference Engine

open-sourceFree
Pricing

Service

WhyLabs

Offering

WhyLabs LLM Monitoring

freemiumFree
Pricing

Service

Writer

Offering

Palmyra X5

pay-as-you-go$0.61M input tokens
Pricing

Service

Writer

Offering

Writer — Full-Stack Enterprise Generative AI

subscription$18per user per month
Pricing

Service

Writesonic

Offering

Writesonic — AI Writing & SEO Platform

freemiumFree
Pricing

Service

xAI

Offering

Grok (xAI)

usage-based$2e-7per input token (grok-4-1-fast)
Pricing

Service

xAI

Offering

Grok 3

pay-as-you-go$31M input tokens
Pricing

Service

xAI

Offering

Grok 3 Beta

pay-as-you-go$31M input tokens
Pricing

Service

xAI

Offering

Grok 3 Mini

pay-as-you-go$0.31M input tokens
Pricing

Service

xAI

Offering

Grok 3 Mini Beta

pay-as-you-go$0.31M input tokens
Pricing

Service

xAI

Offering

Grok 4

pay-as-you-go$31M input tokens
Pricing

Service

xAI

Offering

Grok 4 Fast

pay-as-you-go$0.21M input tokens
Pricing

Service

xAI

Offering

Grok 4.1 Fast

pay-as-you-go$0.21M input tokens
Pricing

Service

xAI

Offering

Grok 4.20

pay-as-you-go$21M input tokens
Pricing

Service

xAI

Offering

Grok 4.20 Beta

pay-as-you-go$31M input tokens
Pricing

Service

xAI

Offering

Grok 4.20 Multi-Agent

pay-as-you-go$21M input tokens
Pricing

Service

xAI

Offering

xAI API (Grok Models)

usage-based$0.0002per 1K input tokens (Grok-2-Mini)
Pricing

Showing 451–500 of 515 offerings