losclouds

AI Models / Compare

Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG,

Creator
NVIDIA
Lifecycle
Active
Context
131.1K
Max output
Released
Oct 10, 2025
Status
unknown
Input
$0.10 / 1M tokens
Output
$0.40 / 1M tokens
Cached read
/ 1M tokens
Cached write
/ 1M tokens
Batch discount
%
Source
OpenRouter
Verified
Apr 5, 2026 (High)

Capabilities

Modalities
texttext
Capabilities
reasoningfunctionCallingstructuredOutputs
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Llama 3.3 Nemotron Super 49B V1.5nvidia-llama-3-3-nemotron-super-49b-v1-5ActiveOct 10, 2025Model available via OpenRouter.

Host Coverage

HostTypeContextPricing NoteDifferences
OpenRouteraggregator131.1K$0.10/1M in · $0.40/1M out via OpenRouter
Migration Guidance

Change Events
DateTypeTitleDescriptionSource
Oct 10, 2025family_addedLlama 3.3 Nemotron Super 49B V1.5 publishedModel made available via OpenRouter.OpenRouter

Other models from NVIDIA

Llama 3.1 Nemotron 70B Instruct, Llama 3.1 Nemotron Ultra 253B v1, Llama Nemotron Super 49B, Llama Nemotron Ultra 253B, Nemotron 3 Nano 30B A3B, Nemotron 3 Nano 30B A3B, Nemotron 3 Super, Nemotron 3 Super, Nemotron Nano 12B 2 VL, Nemotron Nano 12B 2 VL, Nemotron Nano 9B V2, Nemotron Nano 9B V2