Self-hosted, feature-rich web interface for local LLMs via Ollama or OpenAI-compatible APIs. 30,000+ GitHub stars.
AI Inference service catalog
Showing AI Inference services from 8,000+ services. Search within this category, or choose another category or company below.
Otter AI Meeting Agent supports real-time transcription, live chat, automated summaries, insights, and action items.
AI search API providing real-time web search with LLM-generated answers for developer integration.
Premium tier of Perplexity AI with unlimited AI searches, file uploads, image generation, and API access.
Microsoft's family of small language models (SLMs) offering high performance for on-device and edge AI applications.
AI search engine and developer assistant specialized in coding questions, combining web search with code-optimized LLMs.
Conversational AI companion by Inflection AI. Designed for empathetic, supportive, and thoughtful conversations.
Multi-model AI chatbot platform by Quora. Access Claude, GPT-4, Gemini, Llama, and 300+ bots in one interface.
Democratize and productionize Gen AI across your entire org with Portkey's suite of AI gateway, observability, guardrails, and prompt management modules.
Alibaba Cloud's AI model series. Creator of the Qwen3 and QwQ reasoning families, available via DashScope API and open weights.
AI research company developing multimodal language models with strong visual understanding capabilities.
Cloud platform for running open-source AI models via API. Deploy Stable Diffusion, Llama, Whisper, and thousands more with one line of code.
Affordable GPU cloud for AI training and inference. Spot and on-demand GPU instances from $0.2/hr with global pod marketplace.
Discover Rytr, your free AI writing assistant. Craft high-quality content faster than ever before. Start for free and upgrade as you grow!
Salesforce is the #1 AI CRM, helping companies become Agentic Enterprises where humans and agents drive success together through a unified AI, data, and Customer 360 platform.
AI hardware and software company delivering full-stack AI platforms. RDU chip architecture optimized for generative AI workloads.
Pioneer of open-weight diffusion models for image, video, audio, and 3D generation. Creator of Stable Diffusion.
Developer API for Stable Diffusion image, video, and audio generation models from Stability AI.
Chinese AI company developing the Step series of multimodal large language models with strong reasoning capabilities.
AI code completion tool integrated into all major IDEs. Enterprise-grade with on-premise deployment and private model training.
Chinese technology conglomerate developing the Hunyuan large language model family for enterprise and consumer AI applications.
Tencent's AI and machine learning services including NLP, computer vision, and recommendation engines.
Build what's next on the AI Native Cloud. Full-stack AI platform for inference, fine-tuning, and GPU clusters — powered by cutting-edge research.
Showing 73–96 of 105 services