losclouds

AI Models / Compare

Llama 3.1 Nemotron Ultra 253B v1

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Me

Creator
NVIDIA
Lifecycle
Active
Context
131.1K
Max output
Released
Apr 8, 2025
Status
unknown
Input
$0.60 / 1M tokens
Output
$1.80 / 1M tokens
Cached read
/ 1M tokens
Cached write
/ 1M tokens
Batch discount
%
Source
OpenRouter
Verified
Apr 5, 2026 (High)

Capabilities

Modalities
texttext
Capabilities
reasoningstructuredOutputs
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Llama 3.1 Nemotron Ultra 253B v1nvidia-llama-3-1-nemotron-ultra-253b-v1ActiveApr 8, 2025Model available via OpenRouter.

Host Coverage

HostTypeContextPricing NoteDifferences
OpenRouteraggregator131.1K$0.60/1M in · $1.80/1M out via OpenRouter
Migration Guidance

Change Events
DateTypeTitleDescriptionSource
Apr 8, 2025family_addedLlama 3.1 Nemotron Ultra 253B v1 publishedModel made available via OpenRouter.OpenRouter

Other models from NVIDIA

Llama 3.1 Nemotron 70B Instruct, Llama 3.3 Nemotron Super 49B V1.5, Llama Nemotron Super 49B, Llama Nemotron Ultra 253B, Nemotron 3 Nano 30B A3B, Nemotron 3 Nano 30B A3B, Nemotron 3 Super, Nemotron 3 Super, Nemotron Nano 12B 2 VL, Nemotron Nano 12B 2 VL, Nemotron Nano 9B V2, Nemotron Nano 9B V2