losclouds

AI Models / Compare

Qwen3 32B

Single-GPU Qwen3 with reasoning; competitive with much larger models.

Creator
Qwen
Lifecycle
Active
Context
131.1K
Max output
32.8K
Released
Apr 29, 2025
Status
unknown
Input
$0.14 / 1M tokens
Output
$0.56 / 1M tokens
Cached read
/ 1M tokens
Cached write
/ 1M tokens
Batch discount
%
Source
Qwen3 32B pricing
Verified
Apr 5, 2026 (High)

Capabilities

Modalities
texttext
Capabilities
reasoningbatchSupportpromptCachingfunctionCallingstructuredOutputs
Strengths
Single-GPU capable, Good reasoning for size, Apache 2.0
Tradeoffs
Text-only, lower ceiling than 235B
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Qwen3 32Bqwen3-32bActiveApr 29, 2025Current published model family snapshot.

Host Coverage

HostTypeContextPricing NoteDifferences
Alibaba DashScopefirst-party131.1K$0.14/$0.56 per MTok.
Migration Guidance

Best single-GPU open reasoning model in this class.

Replacement models: qwen3-235b-a22b

Change Events
DateTypeTitleDescriptionSource
Apr 29, 2025family_addedQwen3 32B publishedInitial public model family launch.Qwen3 32B release notes

Other models from Qwen

Qwen2.5 72B, Qwen3 235B-A22B, QwQ-32B