AI Models / Compare
GPT-OSS 120B on Groq
High-speed Groq-hosted GPT-OSS tier with reasoning, prompt caching, and tool support.
- Creator
- Groq
- Lifecycle
- Active
- Context
- 131.1K
- Max output
- 65.5K
- Released
- Aug 1, 2025
- Status
- up
- Input
- $0.15 / 1M tokens
- Output
- $0.60 / 1M tokens
- Cached read
- $0.07 / 1M tokens
- Cached write
- $0.15 / 1M tokens
- Batch discount
- —%
- Source
- GPT-OSS 120B on Groq pricing
- Verified
- Apr 2, 2026 (High)
Capabilities
- Modalities
- text→text
- Capabilities
- reasoningwebSearchbatchSupportcodeExecutionpromptCachingfunctionCallingstructuredOutputs
- Strengths
- Fast hosted reasoning, Cheap compared with frontier APIs
- Tradeoffs
- Provider page is host-oriented, not creator-oriented
Migration Guidance
Hosted reasoning option for teams prioritizing Groq latency.