Gopher
A 280B-parameter Transformer model from DeepMind, used to explore scaling laws and as a precursor to Chinchilla.
2021-12-01
280B
Decoder-only Transformer
Proprietary
Specifications
- Parameters
- 280B
- Architecture
- Decoder-only Transformer
- License
- Proprietary
- Context Window
- 2,048 tokens
- Type
- text
- Modalities
- text
Benchmark Scores
Advanced Specifications
- Model Family
- Gopher
- API Access
- Not Available
- Chat Interface
- Not Available