Pricing reference rows.
Review provider-published pricing references for AI Inference. Rows stay unsorted by price when billing units do not match.
Mixed units
Starting prices are shown as references, not a cheapest ranking.
Providers publish these rows in different billing units, so losclouds keeps the original labels and falls back to alphabetical order.
Pricing comparison
50 offerings on this page, listed alphabetically because starting-price units are mixed.
Service
Google Workspace
Offering
Gemini for Google Workspace
Service
Grammarly
Offering
Grammarly Business
Service
Grammarly
Offering
Grammarly GO (AI Writing Features)
Service
IBM watsonx
Offering
IBM watsonx.ai Foundation Models
Service
Intel Gaudi (Habana Labs)
Offering
Intel Gaudi AI Inference
Service
Kling AI
Offering
Kling AI - Video & Image Generation API
Service
Lepton AI
Offering
Lepton AI Inference & Deployment Platform
| Service | Offering | Model | Starting Price↕ | Details |
|---|---|---|---|---|
| Google Gemini | Gemma 3 4B | pay-as-you-go | $0.041M input tokens | Pricing |
| Google Gemini | Gemma 3 4B | free | — | Pricing |
| Google Gemini | Gemma 3n 2B | free | — | Pricing |
| Google Gemini | Gemma 3n 4B | pay-as-you-go | $0.021M input tokens | Pricing |
| Google Gemini | Gemma 3n 4B | free | — | Pricing |
| Google Gemini | Gemma 4 26B A4B | pay-as-you-go | $0.131M input tokens | Pricing |
| Google Gemini | Gemma 4 31B | pay-as-you-go | $0.141M input tokens | Pricing |
| Google Gemini | Lyria 3 Clip Preview | free | — | Pricing |
| Google Gemini | Lyria 3 Pro Preview | free | — | Pricing |
| Google Workspace | Gemini for Google Workspace | subscription | $20per user/month (Gemini Business add-on) | Pricing |
| Grammarly | Grammarly AI Writing Assistant | freemium | Free | Pricing |
| Grammarly | Grammarly Business | subscription | $15per member/month (billed annually, minimum 3 seats) | Pricing |
| Grammarly | Grammarly GO (AI Writing Features) | subscription | $12per month (Pro plan, billed annually) | Pricing |
| Graphcore | Graphcore Poplar SDK | free | Free | Pricing |
| Groq | GPT-OSS 120B on Groq | pay-as-you-go | $0.151M input tokens | Pricing |
| Groq | GPT-OSS 20B on Groq | pay-as-you-go | $0.0751M input tokens | Pricing |
| Groq | Groq Compound | custom | — | Pricing |
| Groq | Groq LLaMA Inference | usage-based | $0.00005per 1K input tokens (LLaMA 3 8B) | Pricing |
| Groq | Groq LPU AI Inference API | usage-based | Free | Pricing |
| Groq | Groq Mixtral Inference | usage-based | $0.00024per 1K input tokens | Pricing |
| Groq | Llama 3.1 8B Instant on Groq | pay-as-you-go | $0.051M input tokens | Pricing |
| Groq | Llama 3.3 70B Versatile on Groq | pay-as-you-go | $0.591M input tokens | Pricing |
| Groq | Llama 4 Scout on Groq | pay-as-you-go | $0.111M input tokens | Pricing |
| Groq | Qwen3 32B on Groq | pay-as-you-go | $0.291M input tokens | Pricing |
| H2O.ai | H2O.ai Model Deployment (Driverless AI + MLOps) | subscription | Free | Pricing |
| Hailuo AI | Hailuo AI MiniMax API | usage-based | $0.0002per 1K input tokens | Pricing |
| Helicone | Helicone - AI Gateway and Observability | freemium | Free | Pricing |
| Hugging Face | Hugging Face Inference Endpoints | usage-based | $0.032per hour (CPU) | Pricing |
| Humanloop | Humanloop LLM Development Platform | freemium | Free | Pricing |
| HyperWrite | HyperWrite AI Models | subscription | $19.99per month (Premium) | Pricing |
| IBM Cloud | IBM Cloud - Watson Machine Learning | usage-based | Free | Pricing |
| IBM Research | Granite 4.0 Micro | pay-as-you-go | $0.0171M input tokens | Pricing |
| IBM watsonx | IBM watsonx.ai Foundation Models | usage-based | $0.0001per 1K tokens (Granite 3B) | Pricing |
| Inception Labs | Mercury | pay-as-you-go | $0.251M input tokens | Pricing |
| Inception Labs | Mercury 2 | pay-as-you-go | $0.251M input tokens | Pricing |
| Inception Labs | Mercury Coder | pay-as-you-go | $0.251M input tokens | Pricing |
| Insilico Medicine | Insilico Medicine PandaOmics & Chemistry42 | enterprise | Free | Pricing |
| Intel Gaudi (Habana Labs) | Intel Gaudi AI Inference | usage-based | $13.11per hour (AWS DL1 instance, dl1.24xlarge) | Pricing |
| InvokeAI | InvokeAI - Generative AI API | open-source | Free | Pricing |
| Jasper | Jasper AI Marketing Content Platform | subscription | $39per month | Pricing |
| Kagi | Kagi - AI Search Summarization | subscription | $5per month | Pricing |
| Khan Academy | Khanmigo - AI Tutoring Assistant | subscription | $4per month | Pricing |
| Kling AI | Kling AI - Video & Image Generation API | usage-based | $0.14per video generation | Pricing |
| Lambda | Lambda - GPU Cloud AI Inference | usage-based | $0.5per hour (A10 GPU) | Pricing |
| LanceDB | LanceDB RAG & AI Application Backend | freemium | Free | Pricing |
| LangChain | LangChain - AI Inference and LLM Integrations | open-source | Free | Pricing |
| Lepton AI | Lepton AI Inference & Deployment Platform | usage-based | $0.0003per 1K tokens (Llama 3 8B) | Pricing |
| Lightning AI | Lightning AI Inference & Deployment | usage-based | Free | Pricing |
| Liquid AI | LFM2-24B-A2B | pay-as-you-go | $0.031M input tokens | Pricing |
| Liquid AI | LFM2.5-1.2B-Instruct | free | — | Pricing |
Showing 151–200 of 515 offerings