Pricing
Model Overview
Tracked token-pricing fields for this model family. Empty pricing fields stay hidden until the source publishes them.
| Price field | Value |
|---|---|
| Input | $3.50 / 1M tokens |
| Output | $3.50 / 1M tokens |
| Source | Llama 3.1 405B pricing |
| Verified | Apr 5, 2026 (High) |
Surface
Capabilities
Input and output modalities, enabled feature flags, strengths, and tradeoffs.
| Attribute | Values |
|---|---|
| Modalities | text totext |
| Capabilities | batchSupportpromptCachingfunctionCallingstructuredOutputs |
| Strengths | Largest open-weight model, GPT-4-class quality |
| Tradeoffs | Needs multi-GPU, expensive to host |
References
Official Links
Canonical launch, documentation, pricing, and release-note URLs.
| Reference | URL |
|---|---|
| Intro | https://ai.meta.com/blog/meta-llama-3-1/ |
| Docs | https://llama.meta.com/docs/ |
| Pricing | https://llama.meta.com/docs/get-started/ |
| Release note | https://llama.meta.com/blog/ |
Coverage
Benchmark Coverage
Reported benchmark families, versions, scores, sources, and notes.
| Benchmark | Version | Score | Date | Source | Notes |
|---|---|---|---|---|---|
| MMLU | 2024 | 87.3 % | Jul 1, 2024 | Meta | Vendor-reported |
| HumanEval | 2024 | 89 % | Jul 1, 2024 | Meta | Vendor-reported |
Lifecycle
Release History
Lifecycle transitions and release timeline for this model family.
| Release | Alias | Lifecycle | Release Date | Deprecation | Shutdown | Summary |
|---|---|---|---|---|---|---|
| Llama 3.1 405B | llama-3-1-405b | Active | Jul 23, 2024 | — | — | Current published model family snapshot. |
Surfaces
Host Coverage
Provider-specific hosting, context, pricing notes, feature differences, and provider-status context.
| Host | Type | Context | Pricing Note | Differences |
|---|---|---|---|---|
| Together AI | cloud | 128.0K | $3.50/$3.50 per MTok. | — |
| DeepInfra | cloud | 128.0K | $1.79/$1.79 per MTok. | — |
Migration
Migration Guidance
Documented migration summary, successor families, and known breaking changes.
| Topic | Details |
|---|---|
| Summary | Use Llama 4 Maverick for better quality with cheaper MoE hosting. |
| Replacement models | llama-4-maverick |
Timeline
Change Events
Cataloged model-family updates and source references.
| Date | Type | Title | Description | Source |
|---|---|---|---|---|
| Jul 23, 2024 | family_added | Llama 3.1 405B published | Initial public model family launch. | Llama 3.1 405B release notes |
From Meta AI