losclouds

AI Models / Compare

Llama 3.1 Nemotron 70B Instruct

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinfor

Creator
NVIDIA
Lifecycle
Active
Context
131.1K
Max output
16.4K
Released
Oct 15, 2024
Status
unknown
Input
$1.20 / 1M tokens
Output
$1.20 / 1M tokens
Cached read
/ 1M tokens
Cached write
/ 1M tokens
Batch discount
%
Source
OpenRouter
Verified
Apr 5, 2026 (High)

Capabilities

Modalities
texttext
Capabilities
functionCallingstructuredOutputs
Official Links

Benchmark Coverage

BenchmarkVersionScoreDateSourceNotes

Release History

ReleaseAliasLifecycleRelease DateDeprecationShutdownSummary
Llama 3.1 Nemotron 70B Instructnvidia-llama-3-1-nemotron-70b-instructActiveOct 15, 2024Model available via OpenRouter.

Host Coverage

HostTypeContextPricing NoteDifferences
OpenRouteraggregator131.1K$1.20/1M in · $1.20/1M out via OpenRouter
Migration Guidance

Change Events
DateTypeTitleDescriptionSource
Oct 15, 2024family_addedLlama 3.1 Nemotron 70B Instruct publishedModel made available via OpenRouter.OpenRouter

Other models from NVIDIA

Llama 3.1 Nemotron Ultra 253B v1, Llama 3.3 Nemotron Super 49B V1.5, Llama Nemotron Super 49B, Llama Nemotron Ultra 253B, Nemotron 3 Nano 30B A3B, Nemotron 3 Nano 30B A3B, Nemotron 3 Super, Nemotron 3 Super, Nemotron Nano 12B 2 VL, Nemotron Nano 12B 2 VL, Nemotron Nano 9B V2, Nemotron Nano 9B V2