losclouds
Model · NVIDIA

Llama Nemotron Ultra 253B

NVIDIA flagship reasoning model with vendor-reported GPQA and AIME results in the open-weight class.

Pricing

Model Overview

Tracked token-pricing fields for this model family. Empty pricing fields stay hidden until the source publishes them.

Model pricing
Price fieldValue
Input$0.60 / 1M tokens
Output$1.80 / 1M tokens
SourceLlama Nemotron Ultra 253B pricing
VerifiedApr 5, 2026 (High)

Surface

Capabilities

Input and output modalities, enabled feature flags, strengths, and tradeoffs.

Model capabilities
AttributeValues
Modalities
text
to
text
Capabilities
reasoningbatchSupportpromptCachingfunctionCallingstructuredOutputs
StrengthsFrontier reasoning quality open-weight, Vendor-reported GPQA result
TradeoffsNeeds 4×B100 or 8×H100 to self-host

References

Official Links

Canonical launch, documentation, pricing, and release-note URLs.

Coverage

Benchmark Coverage

Reported benchmark families, versions, scores, sources, and notes.

Benchmark coverage
BenchmarkVersionScoreDateSourceNotes
GPQA202476.01 %Apr 1, 2025NVIDIAReasoning ON, vendor-reported
AIME 2025202572.5 %Apr 1, 2025NVIDIAReasoning ON, vendor-reported
MATH-500202497 %Apr 1, 2025NVIDIAVendor-reported

Lifecycle

Release History

Lifecycle transitions and release timeline for this model family.

Release history
ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Llama Nemotron Ultra 253Bnemotron-ultra-253bActiveApr 7, 2025Current published model family snapshot.

Surfaces

Host Coverage

Provider-specific hosting, context, pricing notes, feature differences, and provider-status context.

Host coverage
HostTypeContextPricing NoteDifferences
NVIDIA NIMfirst-party131.1K$0.60/$1.80 per MTok.Thinking mode toggle; Multilingual

Migration

Migration Guidance

Documented migration summary, successor families, and known breaking changes.

Migration guidance
TopicDetails
SummaryOpen-weight reasoning model for quality-first workloads.

Timeline

Change Events

Cataloged model-family updates and source references.

Change events
DateTypeTitleDescriptionSource
Apr 7, 2025family_addedLlama Nemotron Ultra 253B publishedInitial public model family launch.Llama Nemotron Ultra 253B release notes

From NVIDIA

Other models from NVIDIA