AI Models / Compare
QwQ-32B
Open-weight reasoning specialist; best-in-class 32B for math, coding, and STEM.
- Creator
- Qwen
- Lifecycle
- Active
- Context
- 131.1K
- Max output
- 32.8K
- Released
- Mar 5, 2025
- Status
- unknown
- Input
- $0.12 / 1M tokens
- Output
- $0.15 / 1M tokens
- Cached read
- — / 1M tokens
- Cached write
- — / 1M tokens
- Batch discount
- —%
- Source
- QwQ-32B pricing
- Verified
- Apr 5, 2026 (High)
Capabilities
- Modalities
- text→text
- Capabilities
- reasoningbatchSupportpromptCachingfunctionCallingstructuredOutputs
- Strengths
- o1-competitive math/reasoning at 32B, Apache 2.0, Very cheap via Groq
- Tradeoffs
- Thinking tokens add latency, text-only
Migration Guidance
Cheapest high-quality reasoning model. Use Qwen3-235B for maximum quality.
Replacement models: qwen3-235b-a22b