Offering
LFM2.5-1.2B-Thinking
Offering detailsContext Window: 32768 tokens; Input Modalities: text
APIs for running AI and machine learning model inference.
50 offerings on this page with service context, pricing, regions, and links.
Offering
LFM2.5-1.2B-Thinking
Offering detailsContext Window: 32768 tokens; Input Modalities: text
Offering
Meta Llama API
Offering detailsModel Access: Llama 3.1 & 3.2; Multimodal: Llama 3.2 Vision; +3 more
Offering
Luma AI API (Dream Machine)
Offering detailsDream Machine API: Ray 2 model; Image-to-Video: Animate any image; +3 more
Offering
Meta AI & Llama Model Family
Offering detailsLlama 3.1 405B: true; Multimodal (Llama 3.2): true; +3 more
Offering
Llama 3.1 405B
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Llama 3.1 70B
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Llama 3.1 70B Instruct
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
Llama 3.1 8B
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Llama 3.1 8B Instruct
Offering detailsContext Window: 16384 tokens; Input Modalities: text
Offering
Llama 3.2 11B Vision
Offering detailsContext Window: 128000 tokens; Input Modalities: text, image
Offering
Llama 3.2 11B Vision Instruct
Offering detailsContext Window: 131072 tokens; Input Modalities: text, image
Offering
Llama 3.2 1B Instruct
Offering detailsContext Window: 60000 tokens; Input Modalities: text
Offering
Llama 3.2 3B Instruct
Offering detailsContext Window: 80000 tokens; Input Modalities: text
Offering
Llama 3.2 3B Instruct
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
Llama 3.2 90B Vision
Offering detailsContext Window: 128000 tokens; Input Modalities: text, image
Offering
Llama 3.3 70B
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Llama 3.3 70B Instruct
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
Llama 3.3 70B Instruct
Offering detailsContext Window: 65536 tokens; Input Modalities: text
Offering
Llama 3 70B Instruct
Offering detailsContext Window: 8192 tokens; Input Modalities: text
Offering
Llama 3 8B Instruct
Offering detailsContext Window: 8192 tokens; Input Modalities: text
Offering
Llama 4 Maverick
Offering detailsContext Window: 1000000 tokens; Input Modalities: text, image
Offering
Llama 4 Scout
Offering detailsContext Window: 10000000 tokens; Input Modalities: text, image
Offering
Llama Guard 3 8B
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
Llama Guard 4 12B
Offering detailsContext Window: 163840 tokens; Input Modalities: image, text
Offering
Meta Llama Open-Weight Models
Offering detailsModel Sizes: 1B, 3B, 8B, 70B, 405B; Multimodal (3.2): 11B, 90B vision; +3 more
Offering
Azure OpenAI Service
Offering detailsModel Coverage: GPT-4, GPT-4o, o1, Whisper, DALL-E; Data Residency: Regional deployment; +3 more
Offering
Phi-4
Offering detailsContext Window: 16384 tokens; Input Modalities: text
Offering
Phi-4 Mini
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
WizardLM-2 8x22B
Offering detailsContext Window: 65535 tokens; Input Modalities: text
Offering
Azure OpenAI Service
Offering detailsEnterprise Security: VNet, Private Endpoint; Content Filtering: true; +3 more
Offering
Azure Bot Service
Offering detailsSLA: 99.9%; Channels: 20+ built-in channels; +3 more
Offering
Azure AI Services (Cognitive Services)
Offering detailsSLA: 99.9%; APIs: Vision, Speech, Language, Decision; +3 more
Offering
Microsoft Copilot (Bing AI)
Offering detailsWeb Grounding: Real-time web search; Image Generation: DALL-E 3 integration; +2 more
Offering
Microsoft Copilot AI Assistant
Offering detailsGPT-4 Turbo Access: free (limited) / priority (Pro); Image Generation: DALL-E 3; +3 more
Offering
Milvus RAG & AI Inference Backend
Offering detailsGPU Acceleration: GPU index build and search; Billion-Scale Search: Sub-millisecond at scale; +3 more
Offering
MiniMax-01
Offering detailsContext Window: 1000192 tokens; Input Modalities: text, image
Offering
MiniMax M1
Offering detailsContext Window: 1000000 tokens; Input Modalities: text
Offering
MiniMax M2
Offering detailsContext Window: 196608 tokens; Input Modalities: text
Offering
MiniMax M2.1
Offering detailsContext Window: 196608 tokens; Input Modalities: text
Offering
MiniMax M2.5
Offering detailsContext Window: 196608 tokens; Input Modalities: text
Offering
MiniMax M2.5
Offering detailsContext Window: 196608 tokens; Input Modalities: text
Offering
MiniMax M2.7
Offering detailsContext Window: 204800 tokens; Input Modalities: text
Offering
MiniMax M2-her
Offering detailsContext Window: 65536 tokens; Input Modalities: text
Offering
Codestral
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Codestral 2508
Offering detailsContext Window: 256000 tokens; Input Modalities: text
Offering
Devstral 2
Offering detailsContext Window: 256000 tokens; Input Modalities: text
Offering
Devstral 2 2512
Offering detailsContext Window: 262144 tokens; Input Modalities: text
Offering
Devstral Medium
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
Devstral Small 1.1
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
Ministral 3 14B 2512
Offering detailsContext Window: 262144 tokens; Input Modalities: text, image
| Service | Offering | Pricing model | Starting price | Regions | Features | Links |
|---|---|---|---|---|---|---|
Liquid AIService details | LFM2.5-1.2B-Thinking Offering details | Free | — | 0 | Context Window: 32768 tokens; Input Modalities: text | |
Llama APIService details | Meta Llama API Offering details | Usage-based | $0.0002 per 1K tokens (Llama 3.1 8B) | 2 | Model Access: Llama 3.1 & 3.2; Multimodal: Llama 3.2 Vision; +3 more | |
Luma AIService details | Luma AI API (Dream Machine) Offering details | Usage-based | $0.140 per video generation (5-second clip) | 1 | Dream Machine API: Ray 2 model; Image-to-Video: Animate any image; +3 more | |
Meta AIService details | Meta AI & Llama Model Family Offering details | Open source | Free | 1 | Llama 3.1 405B: true; Multimodal (Llama 3.2): true; +3 more | |
Meta AIService details | Llama 3.1 405B Offering details | Pay-as-you-go | $3.50 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.1 70B Offering details | Pay-as-you-go | $0.590 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.1 70B Instruct Offering details | Pay-as-you-go | $0.400 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.1 8B Offering details | Pay-as-you-go | $0.050 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.1 8B Instruct Offering details | Pay-as-you-go | $0.020 1M input tokens | 0 | Context Window: 16384 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.2 11B Vision Offering details | Pay-as-you-go | $0.180 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text, image | |
Meta AIService details | Llama 3.2 11B Vision Instruct Offering details | Pay-as-you-go | $0.049 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text, image | |
Meta AIService details | Llama 3.2 1B Instruct Offering details | Pay-as-you-go | $0.027 1M input tokens | 0 | Context Window: 60000 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.2 3B Instruct Offering details | Pay-as-you-go | $0.051 1M input tokens | 0 | Context Window: 80000 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.2 3B Instruct Offering details | Free | — | 0 | Context Window: 131072 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.2 90B Vision Offering details | Pay-as-you-go | $0.880 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text, image | |
Meta AIService details | Llama 3.3 70B Offering details | Pay-as-you-go | $0.590 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.3 70B Instruct Offering details | Pay-as-you-go | $0.100 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
Meta AIService details | Llama 3.3 70B Instruct Offering details | Free | — | 0 | Context Window: 65536 tokens; Input Modalities: text | |
Meta AIService details | Llama 3 70B Instruct Offering details | Pay-as-you-go | $0.510 1M input tokens | 0 | Context Window: 8192 tokens; Input Modalities: text | |
Meta AIService details | Llama 3 8B Instruct Offering details | Pay-as-you-go | $0.030 1M input tokens | 0 | Context Window: 8192 tokens; Input Modalities: text | |
Meta AIService details | Llama 4 Maverick Offering details | Pay-as-you-go | $0.270 1M input tokens | 0 | Context Window: 1000000 tokens; Input Modalities: text, image | |
Meta AIService details | Llama 4 Scout Offering details | Pay-as-you-go | $0.110 1M input tokens | 0 | Context Window: 10000000 tokens; Input Modalities: text, image | |
Meta AIService details | Llama Guard 3 8B Offering details | Pay-as-you-go | $0.020 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
Meta AIService details | Llama Guard 4 12B Offering details | Pay-as-you-go | $0.180 1M input tokens | 0 | Context Window: 163840 tokens; Input Modalities: image, text | |
Meta AIService details | Meta Llama Open-Weight Models Offering details | Open source | Free | 1 | Model Sizes: 1B, 3B, 8B, 70B, 405B; Multimodal (3.2): 11B, 90B vision; +3 more | |
MicrosoftService details | Azure OpenAI Service Offering details | Usage-based | $0.002 per 1K input tokens (GPT-4o mini) | 6 | Model Coverage: GPT-4, GPT-4o, o1, Whisper, DALL-E; Data Residency: Regional deployment; +3 more | |
MicrosoftService details | Phi-4 Offering details | Pay-as-you-go | $0.130 1M input tokens | 0 | Context Window: 16384 tokens; Input Modalities: text | |
MicrosoftService details | Phi-4 Mini Offering details | Pay-as-you-go | $0.040 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
MicrosoftService details | WizardLM-2 8x22B Offering details | Pay-as-you-go | $0.620 1M input tokens | 0 | Context Window: 65535 tokens; Input Modalities: text | |
Microsoft AzureService details | Azure OpenAI Service Offering details | Usage-based | $0.002 per 1K tokens | 7 | Enterprise Security: VNet, Private Endpoint; Content Filtering: true; +3 more | |
Microsoft AzureService details | Azure Bot Service Offering details | Freemium | Free | 8 | SLA: 99.9%; Channels: 20+ built-in channels; +3 more | |
Microsoft AzureService details | Azure AI Services (Cognitive Services) Offering details | Usage-based | Free | 8 | SLA: 99.9%; APIs: Vision, Speech, Language, Decision; +3 more | |
Microsoft BingService details | Microsoft Copilot (Bing AI) Offering details | Freemium | Free | 1 | Web Grounding: Real-time web search; Image Generation: DALL-E 3 integration; +2 more | |
Microsoft CopilotService details | Microsoft Copilot AI Assistant Offering details | Freemium | Free | 1 | GPT-4 Turbo Access: free (limited) / priority (Pro); Image Generation: DALL-E 3; +3 more | |
MilvusService details | Milvus RAG & AI Inference Backend Offering details | Freemium | Free | 4 | GPU Acceleration: GPU index build and search; Billion-Scale Search: Sub-millisecond at scale; +3 more | |
MiniMaxService details | MiniMax-01 Offering details | Pay-as-you-go | $0.200 1M input tokens | 0 | Context Window: 1000192 tokens; Input Modalities: text, image | |
MiniMaxService details | MiniMax M1 Offering details | Pay-as-you-go | $0.400 1M input tokens | 0 | Context Window: 1000000 tokens; Input Modalities: text | |
MiniMaxService details | MiniMax M2 Offering details | Pay-as-you-go | $0.255 1M input tokens | 0 | Context Window: 196608 tokens; Input Modalities: text | |
MiniMaxService details | MiniMax M2.1 Offering details | Pay-as-you-go | $0.270 1M input tokens | 0 | Context Window: 196608 tokens; Input Modalities: text | |
MiniMaxService details | MiniMax M2.5 Offering details | Pay-as-you-go | $0.118 1M input tokens | 0 | Context Window: 196608 tokens; Input Modalities: text | |
MiniMaxService details | MiniMax M2.5 Offering details | Free | — | 0 | Context Window: 196608 tokens; Input Modalities: text | |
MiniMaxService details | MiniMax M2.7 Offering details | Pay-as-you-go | $0.300 1M input tokens | 0 | Context Window: 204800 tokens; Input Modalities: text | |
MiniMaxService details | MiniMax M2-her Offering details | Pay-as-you-go | $0.300 1M input tokens | 0 | Context Window: 65536 tokens; Input Modalities: text | |
Mistral AIService details | Codestral Offering details | Pay-as-you-go | $0.300 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Mistral AIService details | Codestral 2508 Offering details | Pay-as-you-go | $0.300 1M input tokens | 0 | Context Window: 256000 tokens; Input Modalities: text | |
Mistral AIService details | Devstral 2 Offering details | Pay-as-you-go | $0.400 1M input tokens | 0 | Context Window: 256000 tokens; Input Modalities: text | |
Mistral AIService details | Devstral 2 2512 Offering details | Pay-as-you-go | $0.400 1M input tokens | 0 | Context Window: 262144 tokens; Input Modalities: text | |
Mistral AIService details | Devstral Medium Offering details | Pay-as-you-go | $0.400 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
Mistral AIService details | Devstral Small 1.1 Offering details | Pay-as-you-go | $0.100 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
Mistral AIService details | Ministral 3 14B 2512 Offering details | Pay-as-you-go | $0.200 1M input tokens | 0 | Context Window: 262144 tokens; Input Modalities: text, image |
Showing 201–250 of 515 offerings