losclouds
Model · NVIDIA

Llama 3.1 Nemotron Ultra 253B v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Me

Pricing

Model Overview

Tracked token-pricing fields for this model family. Empty pricing fields stay hidden until the source publishes them.

Model pricing
Price fieldValue
Input$0.60 / 1M tokens
Output$1.80 / 1M tokens
SourceOpenRouter
VerifiedApr 5, 2026 (High)

Surface

Capabilities

Input and output modalities, enabled feature flags, strengths, and tradeoffs.

Model capabilities
AttributeValues
Modalities
text
to
text
Capabilities
reasoningstructuredOutputs

References

Official Links

Canonical launch, documentation, pricing, and release-note URLs.

Coverage

Benchmark Coverage

Reported benchmark families, versions, scores, sources, and notes.

Benchmark coverage
BenchmarkVersionScoreDateSourceNotes

Lifecycle

Release History

Lifecycle transitions and release timeline for this model family.

Release history
ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Llama 3.1 Nemotron Ultra 253B v1nvidia-llama-3-1-nemotron-ultra-253b-v1ActiveApr 8, 2025Model available via OpenRouter.

Surfaces

Host Coverage

Provider-specific hosting, context, pricing notes, feature differences, and provider-status context.

Host coverage
HostTypeContextPricing NoteDifferences
OpenRouteraggregator131.1K$0.60/1M in · $1.80/1M out via OpenRouter

Migration

Migration Guidance

Documented migration summary, successor families, and known breaking changes.

Migration guidance
TopicDetails
Summary

Timeline

Change Events

Cataloged model-family updates and source references.

Change events
DateTypeTitleDescriptionSource
Apr 8, 2025family_addedLlama 3.1 Nemotron Ultra 253B v1 publishedModel made available via OpenRouter.OpenRouter

From NVIDIA

Other models from NVIDIA