AI Models / Compare
Qwen3 32B
Single-GPU Qwen3 with reasoning; competitive with much larger models.
- Creator
- Qwen
- Lifecycle
- Active
- Context
- 131.1K
- Max output
- 32.8K
- Released
- Apr 29, 2025
- Status
- unknown
- Input
- $0.14 / 1M tokens
- Output
- $0.56 / 1M tokens
- Cached read
- — / 1M tokens
- Cached write
- — / 1M tokens
- Batch discount
- —%
- Source
- Qwen3 32B pricing
- Verified
- Apr 5, 2026 (High)
Capabilities
- Modalities
- text→text
- Capabilities
- reasoningbatchSupportpromptCachingfunctionCallingstructuredOutputs
- Strengths
- Single-GPU capable, Good reasoning for size, Apache 2.0
- Tradeoffs
- Text-only, lower ceiling than 235B
Migration Guidance
Best single-GPU open reasoning model in this class.
Replacement models: qwen3-235b-a22b