Chinchilla
70B-parameter model trained on 4× more data than typical, to be compute-optimal. Achieved superior accuracy vs much larger models, validating data-scaling laws.
2022-03-01
70B
Decoder-only Transformer
Proprietary
Specifications
- Parameters
- 70B
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 2,048 tokens
- Type
- text
- Modalities
- text
Benchmark Scores
Advanced Specifications
- Model Family
- Chinchilla
- API Access
- Not Available
- Chat Interface
- Not Available