AI Models / Compare
Llama 3.1 Nemotron 70B Instruct
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinfor
- Creator
- NVIDIA
- Lifecycle
- Active
- Context
- 131.1K
- Max output
- 16.4K
- Released
- Oct 15, 2024
- Status
- unknown
- Input
- $1.20 / 1M tokens
- Output
- $1.20 / 1M tokens
- Cached read
- — / 1M tokens
- Cached write
- — / 1M tokens
- Batch discount
- —%
- Source
- OpenRouter
- Verified
- Apr 5, 2026 (High)
Capabilities
- Modalities
- text→text
- Capabilities
- functionCallingstructuredOutputs
Other models from NVIDIA
Llama 3.1 Nemotron Ultra 253B v1, Llama 3.3 Nemotron Super 49B V1.5, Llama Nemotron Super 49B, Llama Nemotron Ultra 253B, Nemotron 3 Nano 30B A3B, Nemotron 3 Nano 30B A3B, Nemotron 3 Super, Nemotron 3 Super, Nemotron Nano 12B 2 VL, Nemotron Nano 12B 2 VL, Nemotron Nano 9B V2, Nemotron Nano 9B V2