Offering
GPT-5 Chat
Offering detailsContext Window: 128000 tokens; Input Modalities: file, image, text
APIs for running AI and machine learning model inference.
50 offerings on this page with service context, pricing, regions, and links.
Offering
GPT-5 Chat
Offering detailsContext Window: 128000 tokens; Input Modalities: file, image, text
Offering
GPT-5 Codex
Offering detailsContext Window: 400000 tokens; Input Modalities: text, image
Offering
GPT-5 Image
Offering detailsContext Window: 400000 tokens; Input Modalities: image, text, file
Offering
GPT-5 Image Mini
Offering detailsContext Window: 400000 tokens; Input Modalities: file, image, text
Offering
GPT-5 Mini
Offering detailsContext Window: 400000 tokens; Input Modalities: text, image, file
Offering
GPT-5 Nano
Offering detailsContext Window: 400000 tokens; Input Modalities: text, image, file
Offering
GPT-5 Pro
Offering detailsContext Window: 400000 tokens; Input Modalities: image, text, file
Offering
GPT Audio
Offering detailsContext Window: 128000 tokens; Input Modalities: text, audio
Offering
GPT Audio Mini
Offering detailsContext Window: 128000 tokens; Input Modalities: text, audio
Offering
gpt-oss-120b
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
gpt-oss-120b
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
gpt-oss-20b
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
gpt-oss-20b
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
gpt-oss-safeguard-20b
Offering detailsContext Window: 131072 tokens; Input Modalities: text
Offering
o1
Offering detailsContext Window: 200000 tokens; Input Modalities: text, image
Offering
o1-pro
Offering detailsContext Window: 200000 tokens; Input Modalities: text, image, file
Offering
o3
Offering detailsContext Window: 200000 tokens; Input Modalities: text, image
Offering
o3 Deep Research
Offering detailsContext Window: 200000 tokens; Input Modalities: image, text, file
Offering
o3-mini
Offering detailsContext Window: 200000 tokens; Input Modalities: text
Offering
o3 Mini High
Offering detailsContext Window: 200000 tokens; Input Modalities: text, file
Offering
o3 Pro
Offering detailsContext Window: 200000 tokens; Input Modalities: text, file, image
Offering
o4-mini
Offering detailsContext Window: 200000 tokens; Input Modalities: text, image
Offering
o4 Mini Deep Research
Offering detailsContext Window: 200000 tokens; Input Modalities: file, image, text
Offering
o4 Mini High
Offering detailsContext Window: 200000 tokens; Input Modalities: image, text, file
Offering
OpenAI GPT-5 Family
Offering detailsChain-of-thought: Internal reasoning; Math/Science: PhD-level; +3 more
Offering
OpenAI Whisper (Speech-to-Text)
Offering detailsLanguages: 99 languages; Translation: Any → English; +3 more
Offering
Perplexity AI Search
Offering detailsReal-time Search: true; Citations: true; +2 more
Offering
Perplexity Pro
Offering detailsCore Feature: Included; Support: Standard; +1 more
Offering
Sonar
Offering detailsContext Window: 128000 tokens; Input Modalities: text, image
Offering
Sonar Deep Research
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Sonar Pro
Offering detailsContext Window: 200000 tokens; Input Modalities: text, image
Offering
Sonar Pro Search
Offering detailsContext Window: 200000 tokens; Input Modalities: text, image
Offering
Sonar Reasoning Pro
Offering detailsContext Window: 128000 tokens; Input Modalities: text
Offering
Perplexity API
Offering detailsReal-time Web Search: true; Citations: true; +2 more
Offering
Perplexity Pro
Offering detailsModel Selection: GPT-4o, Claude, Gemini; File Upload: PDF, docs, images; +2 more
Offering
Microsoft Phi Small Language Models
Offering detailsOn-Device Capable: true; Long Context: 128K tokens; +2 more
Offering
Phind AI Developer Search
Offering detailsCode-Optimized Search: true; VS Code Extension: true; +2 more
Offering
Pi AI Conversational Companion
Offering detailsEmpathetic Conversation: true; Voice Mode: 5 voices; +2 more
Offering
Pieces AI Long-Term Memory
Offering detailsLong-Term Memory: Persistent AI context; Local AI Models: Offline inference; +2 more
Offering
Pinecone Inference API
Offering detailsEmbedding Models: multilingual-e5-large; Reranking: bge-reranker-v2-m3; +2 more
Offering
Poe Multi-Model AI Platform
Offering details300+ AI Models: 300+; Custom Bot Creation: true; +2 more
Offering
Prismic AI Content Generation
Offering detailsAI Writing Assist: In-editor generation; AI Translation: Multi-locale; +2 more
Offering
Qualcomm AI Inference (Cloud AI)
Offering detailsPerformance: 400-2000+ TOPS; Power Efficiency: Industry-leading perf/watt; +3 more
Offering
Poe by Quora
Offering detailsMulti-model Access: GPT-4, Claude, Gemini+; Custom Bots: System prompt builder; +3 more
Offering
Qwen LLM API (Alibaba Cloud)
Offering detailsOpen Weights: true; Code Generation: Qwen-Coder; +2 more
Offering
Qwen2.5 72B Instruct
Offering detailsContext Window: 32768 tokens; Input Modalities: text
Offering
Qwen2.5 7B Instruct
Offering detailsContext Window: 32768 tokens; Input Modalities: text
Offering
Qwen2.5 Coder 32B Instruct
Offering detailsContext Window: 32768 tokens; Input Modalities: text
Offering
Qwen-Max
Offering detailsContext Window: 32768 tokens; Input Modalities: text
Offering
Qwen-Plus
Offering detailsContext Window: 1000000 tokens; Input Modalities: text
| Service | Offering | Pricing model | Starting price | Regions | Features | Links |
|---|---|---|---|---|---|---|
OpenAIService details | GPT-5 Chat Offering details | Pay-as-you-go | $1.25 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: file, image, text | |
OpenAIService details | GPT-5 Codex Offering details | Pay-as-you-go | $1.25 1M input tokens | 0 | Context Window: 400000 tokens; Input Modalities: text, image | |
OpenAIService details | GPT-5 Image Offering details | Pay-as-you-go | $10 1M input tokens | 0 | Context Window: 400000 tokens; Input Modalities: image, text, file | |
OpenAIService details | GPT-5 Image Mini Offering details | Pay-as-you-go | $2.50 1M input tokens | 0 | Context Window: 400000 tokens; Input Modalities: file, image, text | |
OpenAIService details | GPT-5 Mini Offering details | Pay-as-you-go | $0.250 1M input tokens | 0 | Context Window: 400000 tokens; Input Modalities: text, image, file | |
OpenAIService details | GPT-5 Nano Offering details | Pay-as-you-go | $0.050 1M input tokens | 0 | Context Window: 400000 tokens; Input Modalities: text, image, file | |
OpenAIService details | GPT-5 Pro Offering details | Pay-as-you-go | $15 1M input tokens | 0 | Context Window: 400000 tokens; Input Modalities: image, text, file | |
OpenAIService details | GPT Audio Offering details | Pay-as-you-go | $2.50 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text, audio | |
OpenAIService details | GPT Audio Mini Offering details | Pay-as-you-go | $0.600 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text, audio | |
OpenAIService details | gpt-oss-120b Offering details | Pay-as-you-go | $0.039 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
OpenAIService details | gpt-oss-120b Offering details | Free | — | 0 | Context Window: 131072 tokens; Input Modalities: text | |
OpenAIService details | gpt-oss-20b Offering details | Pay-as-you-go | $0.030 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
OpenAIService details | gpt-oss-20b Offering details | Free | — | 0 | Context Window: 131072 tokens; Input Modalities: text | |
OpenAIService details | gpt-oss-safeguard-20b Offering details | Pay-as-you-go | $0.075 1M input tokens | 0 | Context Window: 131072 tokens; Input Modalities: text | |
OpenAIService details | o1 Offering details | Pay-as-you-go | $15 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, image | |
OpenAIService details | o1-pro Offering details | Pay-as-you-go | $150 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, image, file | |
OpenAIService details | o3 Offering details | Pay-as-you-go | $2 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, image | |
OpenAIService details | o3 Deep Research Offering details | Pay-as-you-go | $10 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: image, text, file | |
OpenAIService details | o3-mini Offering details | Pay-as-you-go | $1.10 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text | |
OpenAIService details | o3 Mini High Offering details | Pay-as-you-go | $1.10 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, file | |
OpenAIService details | o3 Pro Offering details | Pay-as-you-go | $20 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, file, image | |
OpenAIService details | o4-mini Offering details | Pay-as-you-go | $1.10 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, image | |
OpenAIService details | o4 Mini Deep Research Offering details | Pay-as-you-go | $2 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: file, image, text | |
OpenAIService details | o4 Mini High Offering details | Pay-as-you-go | $1.10 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: image, text, file | |
OpenAIService details | OpenAI GPT-5 Family Offering details | Usage-based | $0.000 per input token (GPT-5) | 1 | Chain-of-thought: Internal reasoning; Math/Science: PhD-level; +3 more | |
OpenAIService details | OpenAI Whisper (Speech-to-Text) Offering details | Usage-based | $0.006 per minute of audio | 1 | Languages: 99 languages; Translation: Any → English; +3 more | |
PerplexityService details | Perplexity AI Search Offering details | Freemium | Free | 1 | Real-time Search: true; Citations: true; +2 more | |
PerplexityService details | Perplexity Pro Offering details | Subscription | $20 per month | 1 | Core Feature: Included; Support: Standard; +1 more | |
PerplexityService details | Sonar Offering details | Pay-as-you-go | $1 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text, image | |
PerplexityService details | Sonar Deep Research Offering details | Pay-as-you-go | $2 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
PerplexityService details | Sonar Pro Offering details | Pay-as-you-go | $3 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, image | |
PerplexityService details | Sonar Pro Search Offering details | Pay-as-you-go | $3 1M input tokens | 0 | Context Window: 200000 tokens; Input Modalities: text, image | |
PerplexityService details | Sonar Reasoning Pro Offering details | Pay-as-you-go | $2 1M input tokens | 0 | Context Window: 128000 tokens; Input Modalities: text | |
Perplexity APIService details | Perplexity API Offering details | Usage-based | $0.200 per 1M tokens (Sonar Small) | 2 | Real-time Web Search: true; Citations: true; +2 more | |
Perplexity ProService details | Perplexity Pro Offering details | Subscription | $20 per month | 1 | Model Selection: GPT-4o, Claude, Gemini; File Upload: PDF, docs, images; +2 more | |
Phi (Microsoft)Service details | Microsoft Phi Small Language Models Offering details | Usage-based | $0.0001 per 1K tokens (Phi-3.5 Mini via Azure) | 5 | On-Device Capable: true; Long Context: 128K tokens; +2 more | |
PhindService details | Phind AI Developer Search Offering details | Freemium | Free | 1 | Code-Optimized Search: true; VS Code Extension: true; +2 more | |
PiService details | Pi AI Conversational Companion Offering details | Free | Free | 1 | Empathetic Conversation: true; Voice Mode: 5 voices; +2 more | |
Pieces for DevelopersService details | Pieces AI Long-Term Memory Offering details | Freemium | Free | 1 | Long-Term Memory: Persistent AI context; Local AI Models: Offline inference; +2 more | |
PineconeService details | Pinecone Inference API Offering details | Usage-based | $0.000 per 1K tokens | 3 | Embedding Models: multilingual-e5-large; Reranking: bge-reranker-v2-m3; +2 more | |
PoeService details | Poe Multi-Model AI Platform Offering details | Freemium | Free | 1 | 300+ AI Models: 300+; Custom Bot Creation: true; +2 more | |
PrismicService details | Prismic AI Content Generation Offering details | Subscription | $9 per user per month | 1 | AI Writing Assist: In-editor generation; AI Translation: Multi-locale; +2 more | |
QualcommService details | Qualcomm AI Inference (Cloud AI) Offering details | Hardware-based | Free | 1 | Performance: 400-2000+ TOPS; Power Efficiency: Industry-leading perf/watt; +3 more | |
QuoraService details | Poe by Quora Offering details | Freemium | Free | 1 | Multi-model Access: GPT-4, Claude, Gemini+; Custom Bots: System prompt builder; +3 more | |
QwenService details | Qwen LLM API (Alibaba Cloud) Offering details | Usage-based | $0.0004 per 1K tokens (Qwen-Turbo) | 3 | Open Weights: true; Code Generation: Qwen-Coder; +2 more | |
QwenService details | Qwen2.5 72B Instruct Offering details | Pay-as-you-go | $0.120 1M input tokens | 0 | Context Window: 32768 tokens; Input Modalities: text | |
QwenService details | Qwen2.5 7B Instruct Offering details | Pay-as-you-go | $0.040 1M input tokens | 0 | Context Window: 32768 tokens; Input Modalities: text | |
QwenService details | Qwen2.5 Coder 32B Instruct Offering details | Pay-as-you-go | $0.660 1M input tokens | 0 | Context Window: 32768 tokens; Input Modalities: text | |
QwenService details | Qwen-Max Offering details | Pay-as-you-go | $1.04 1M input tokens | 0 | Context Window: 32768 tokens; Input Modalities: text | |
QwenService details | Qwen-Plus Offering details | Pay-as-you-go | $0.260 1M input tokens | 0 | Context Window: 1000000 tokens; Input Modalities: text |
Showing 351–400 of 515 offerings