losclouds
Model · NVIDIA

Llama 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG,

Pricing

Model Overview

Tracked token-pricing fields for this model family. Empty pricing fields stay hidden until the source publishes them.

Model pricing
Price fieldValue
Input$0.10 / 1M tokens
Output$0.40 / 1M tokens
SourceOpenRouter
VerifiedApr 5, 2026 (High)

Surface

Capabilities

Input and output modalities, enabled feature flags, strengths, and tradeoffs.

Model capabilities
AttributeValues
Modalities
text
to
text
Capabilities
reasoningfunctionCallingstructuredOutputs

References

Official Links

Canonical launch, documentation, pricing, and release-note URLs.

Coverage

Benchmark Coverage

Reported benchmark families, versions, scores, sources, and notes.

Benchmark coverage
BenchmarkVersionScoreDateSourceNotes

Lifecycle

Release History

Lifecycle transitions and release timeline for this model family.

Release history
ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Llama 3.3 Nemotron Super 49B V1.5nvidia-llama-3-3-nemotron-super-49b-v1-5ActiveOct 10, 2025Model available via OpenRouter.

Surfaces

Host Coverage

Provider-specific hosting, context, pricing notes, feature differences, and provider-status context.

Host coverage
HostTypeContextPricing NoteDifferences
OpenRouteraggregator131.1K$0.10/1M in · $0.40/1M out via OpenRouter

Migration

Migration Guidance

Documented migration summary, successor families, and known breaking changes.

Migration guidance
TopicDetails
Summary

Timeline

Change Events

Cataloged model-family updates and source references.

Change events
DateTypeTitleDescriptionSource
Oct 10, 2025family_addedLlama 3.3 Nemotron Super 49B V1.5 publishedModel made available via OpenRouter.OpenRouter

From NVIDIA

Other models from NVIDIA