losclouds
Model · Qwen

Qwen3 32B

Single-GPU Qwen3 with reasoning; competitive with much larger models.

Pricing

Model Overview

Tracked token-pricing fields for this model family. Empty pricing fields stay hidden until the source publishes them.

Model pricing
Price fieldValue
Input$0.14 / 1M tokens
Output$0.56 / 1M tokens
SourceQwen3 32B pricing
VerifiedApr 5, 2026 (High)

Surface

Capabilities

Input and output modalities, enabled feature flags, strengths, and tradeoffs.

Model capabilities
AttributeValues
Modalities
text
to
text
Capabilities
reasoningbatchSupportpromptCachingfunctionCallingstructuredOutputs
StrengthsSingle-GPU capable, Good reasoning for size, Apache 2.0
TradeoffsText-only, lower ceiling than 235B

References

Official Links

Canonical launch, documentation, pricing, and release-note URLs.

Coverage

Benchmark Coverage

Reported benchmark families, versions, scores, sources, and notes.

Benchmark coverage
BenchmarkVersionScoreDateSourceNotes

Lifecycle

Release History

Lifecycle transitions and release timeline for this model family.

Release history
ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Qwen3 32Bqwen3-32bActiveApr 29, 2025Current published model family snapshot.

Surfaces

Host Coverage

Provider-specific hosting, context, pricing notes, feature differences, and provider-status context.

Host coverage
HostTypeContextPricing NoteDifferences
Alibaba DashScopefirst-party131.1K$0.14/$0.56 per MTok.

Migration

Migration Guidance

Documented migration summary, successor families, and known breaking changes.

Migration guidance
TopicDetails
SummarySingle-GPU open reasoning model in this class.
Replacement modelsqwen3-235b-a22b

Timeline

Change Events

Cataloged model-family updates and source references.

Change events
DateTypeTitleDescriptionSource
Apr 29, 2025family_addedQwen3 32B publishedInitial public model family launch.Qwen3 32B release notes

From Qwen

Other models from Qwen