losclouds

AI Models / Compare

Qwen3 32B on Groq

Current Groq-hosted Qwen family for multilingual reasoning and tool-oriented chat.

Creator
Groq
Lifecycle
Active
Context
131.1K
Max output
32.8K
Released
Sep 1, 2025
Status
up
Input
$0.29 / 1M tokens
Output
$0.59 / 1M tokens
Cached read
/ 1M tokens
Cached write
/ 1M tokens
Batch discount
%
Source
Qwen3 32B on Groq pricing
Verified
Apr 2, 2026 (High)

Capabilities

Modalities
texttext
Capabilities
reasoningbatchSupportpromptCachingfunctionCallingstructuredOutputs
Strengths
Strong multilingual value, Fast hosted inference
Tradeoffs
Host-oriented page with limited creator-specific detail
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Qwen3 32B on Groqgroq-qwen3-32bActiveSep 1, 2025Current published model family snapshot.

Host Coverage

HostTypeContextPricing NoteDifferences
Groq APIfirst-party131.1KReference production Groq pricing.Production model tier
Migration Guidance

Open-model alternative when multilingual reasoning matters more than pure lowest latency.

Replacement models: groq-gpt-oss-120b

Change Events
DateTypeTitleDescriptionSource
Sep 1, 2025family_addedQwen3 32B on Groq publishedInitial public model family launch.Qwen3 32B on Groq release notes

Other models from Groq

GPT-OSS 120B on Groq, GPT-OSS 20B on Groq, Groq Compound, Llama 3.1 8B Instant on Groq, Llama 3.3 70B Versatile on Groq, Llama 4 Scout on Groq