Llama 3.1 (405B)
Third-generation Llama model, reportedly 405B parameters with 15.6T tokens of training. Introduced a massive context window (128k tokens) and expanded multilingual capability. At release, it was the largest openly available model to date.
2024-07-01
405B
Decoder-only Transformer
Llama 3 License
Specifications
- Parameters
- 405B
- Architecture
- Decoder-only Transformer
- License
- Llama 3 License
- Context Window
- 128,000 tokens
- Type
- text
- Modalities
- text
Benchmark Scores
Advanced Specifications
- Model Family
- Llama
- API Access
- Not Available
- Chat Interface
- Not Available