losclouds
Compare · AI Inference

What's on offer.

APIs for running AI and machine learning model inference.

Offerings

50 offerings on this page with service context, pricing, regions, and links.

DeepSeek
DeepSeekService details

Offering

DeepSeek R1

Offering details
Pay-as-you-go$0.140 per 1M input tokens (cache hit)1 region

Model Alias: deepseek-reasoner; Context Window: 64K tokens; +3 more

DeepSeek
DeepSeekService details

Offering

R1 0528

Offering details
Pay-as-you-go$0.450 1M input tokens0 regions

Context Window: 163840 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

R1 Distill Llama 70B

Offering details
Pay-as-you-go$0.700 1M input tokens0 regions

Context Window: 131072 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

R1 Distill Qwen 32B

Offering details
Pay-as-you-go$0.290 1M input tokens0 regions

Context Window: 32768 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek Reasoner

Offering details
Pay-as-you-go$0.550 1M input tokens0 regions

Context Window: 128000 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek V3

Offering details
Usage-based$0.000 per 1K input tokens (cache hit)1 region

Context Window: 64K tokens; Open Source: true; +3 more

DeepSeek
DeepSeekService details

Offering

DeepSeek V3.1 Terminus

Offering details
Pay-as-you-go$0.210 1M input tokens0 regions

Context Window: 163840 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek V3.2

Offering details
Pay-as-you-go$0.260 1M input tokens0 regions

Context Window: 163840 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek V3.2 Exp

Offering details
Pay-as-you-go$0.270 1M input tokens0 regions

Context Window: 163840 tokens; Input Modalities: text

DeepSeek
DeepSeekService details

Offering

DeepSeek V3.2 Speciale

Offering details
Pay-as-you-go$0.400 1M input tokens0 regions

Context Window: 163840 tokens; Input Modalities: text

EdgeDB
EdgeDBService details

Offering

EdgeDB AI (ext::ai)

Offering details
Usage-basedFree2 regions

Embedding Providers: OpenAI, Mistral, Anthropic; Vector Search: pgvector; +2 more

ElevenLabs
ElevenLabsService details

Offering

ElevenLabs Voice API

Offering details
Usage-basedFree1 region

Realism: Industry-leading TTS; Voice Cloning: 1-minute sample; +3 more

Etched
EtchedService details

Offering

Etched Sohu AI Inference Service

Offering details
Usage-basedFree1 region

Throughput: 500K tokens/sec; Chip Architecture: Transformer ASIC; +2 more

fal.ai
fal.aiService details

Offering

fal.ai Serverless AI Inference

Offering details
Usage-based$0.0003 per second of compute3 regions

Flux Model Support: true; Cold Start Time: <500ms; +3 more

Fireworks AI
Fireworks AIService details

Offering

Fireworks AI Fast Inference API

Offering details
Usage-based$0.0002 per 1K tokens3 regions

OpenAI-Compatible API: true; Model Selection: 50+ models; +3 more

Gemma
GemmaService details

Offering

Gemma Open-Weight Language Models

Offering details
Open sourceFree4 regions

Model Sizes: 2B, 7B, 9B, 27B; Open Weights: true; +3 more

Gemma
GemmaService details

Offering

Gemma Open Models

Offering details
FreeFree1 region

License: Gemma Terms of Use (commercial allowed); Model Variants: Gemma, CodeGemma, PaliGemma, RecurrentGemma; +3 more

GitBook
GitBookService details

Offering

GitBook AI

Offering details
SubscriptionFree1 region

Answer Quality: Cited, doc-grounded responses; Setup: Zero configuration; +2 more

Google Cloud Platform
Google Cloud PlatformService details

Offering

Vertex AI — Google Cloud ML Platform

Offering details
Usage-based$0.0001 per 1K characters input (Gemini 1.5 Flash)15 regions

Gemini API Access: Gemini 1.5 Flash & Pro; Vertex AI Studio: No-code prompt experimentation; +3 more

Google Cloud Platform
Google Cloud PlatformService details

Offering

Google Cloud Natural Language API

Offering details
Usage-basedFree8 regions

Free Tier: 5,000 units/month; Analysis Types: Sentiment, Entity, Syntax, Classification; +3 more

Google Gemini
Google GeminiService details

Offering

Gemini 1.5 Flash

Offering details
Pay-as-you-go$0.075 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 1.5 Pro

Offering details
Pay-as-you-go$1.25 1M input tokens0 regions

Context Window: 2097152 tokens; Input Modalities: text, image, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 2.0 Flash

Offering details
Pay-as-you-go$0.100 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 2.0 Flash

Offering details
Pay-as-you-go$0.100 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, file, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 2.0 Flash Lite

Offering details
Pay-as-you-go$0.075 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, file, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 2.5 Flash

Offering details
Pay-as-you-go$0.300 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, video, audio

Google Gemini
Google GeminiService details

Offering

Nano Banana (Gemini 2.5 Flash Image)

Offering details
Pay-as-you-go$0.300 1M input tokens0 regions

Context Window: 32768 tokens; Input Modalities: image, text

Google Gemini
Google GeminiService details

Offering

Gemini 2.5 Flash-Lite

Offering details
Pay-as-you-go$0.100 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, video, audio, pdf

Google Gemini
Google GeminiService details

Offering

Gemini 2.5 Flash Lite Preview 09-2025

Offering details
Pay-as-you-go$0.100 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, file, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 2.5 Pro

Offering details
Pay-as-you-go$1.25 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, audio, video

Google Gemini
Google GeminiService details

Offering

Gemini 2.5 Pro Preview 06-05

Offering details
Pay-as-you-go$1.25 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: file, image, text, audio

Google Gemini
Google GeminiService details

Offering

Gemini 2.5 Pro Preview 05-06

Offering details
Pay-as-you-go$1.25 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, file, audio, video

Google Gemini
Google GeminiService details

Offering

Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Offering details
Pay-as-you-go$0.500 1M input tokens0 regions

Context Window: 65536 tokens; Input Modalities: image, text

Google Gemini
Google GeminiService details

Offering

Gemini 3.1 Flash Lite Preview

Offering details
Pay-as-you-go$0.250 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, video, file, audio

Google Gemini
Google GeminiService details

Offering

Gemini 3.1 Pro Preview

Offering details
Pay-as-you-go$2 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: audio, file, image, text, video

Google Gemini
Google GeminiService details

Offering

Gemini 3.1 Pro Preview Custom Tools

Offering details
Pay-as-you-go$2 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, audio, image, video, file

Google Gemini
Google GeminiService details

Offering

Gemini 3 Flash Preview

Offering details
Pay-as-you-go$0.500 1M input tokens0 regions

Context Window: 1048576 tokens; Input Modalities: text, image, file, audio, video

Google Gemini
Google GeminiService details

Offering

Nano Banana Pro (Gemini 3 Pro Image Preview)

Offering details
Pay-as-you-go$2 1M input tokens0 regions

Context Window: 65536 tokens; Input Modalities: image, text

Google Gemini
Google GeminiService details

Offering

Google Gemini Advanced

Offering details
Subscription$19.99 per month1 region

Context Window: 1M tokens; Gemini in Gmail: true; +3 more

Google Gemini
Google GeminiService details

Offering

Gemini Flash

Offering details
Usage-based$0.0003 per 1K input tokens1 region

Context Window: 1M tokens; Multimodal: Text, Image, Video, Audio; +3 more

Google Gemini
Google GeminiService details

Offering

Gemini Pro

Offering details
Usage-based$0.0013 per 1K input tokens1 region

Context Window: 2M tokens; Multimodal: Text, Image, Video, Audio; +3 more

Google Gemini
Google GeminiService details

Offering

Gemma 3 12B

Offering details
Pay-as-you-go$0.040 1M input tokens0 regions

Context Window: 131072 tokens; Input Modalities: text, image

Google Gemini
Google GeminiService details

Offering

Gemma 3 27B

Offering details
Pay-as-you-go$0.080 1M input tokens0 regions

Context Window: 131072 tokens; Input Modalities: text, image

Showing 101–150 of 515 offerings