Pixtral 123B
A 123B-parameter **multimodal** model from Mistral, combining language with vision ("Pix-"). A 12B smaller version was released under Apache 2.0. Pixtral extends Mistral's LLM capability to image understanding and description tasks.
2024-11-01
123B
Multimodal Transformer (vision + text)
Mistral Research License (for 123B)
Specifications
- Parameters
- 123B
- Architecture
- Multimodal Transformer (vision + text)
- License
- Mistral Research License (for 123B)
- Context Window
- 8,192 tokens
- Type
- multimodal
- Modalities
- textvision
Benchmark Scores
Advanced Specifications
- Model Family
- Pixtral
- API Access
- Not Available
- Chat Interface
- Not Available