Gemini 2.0 Pro
Google's best model yet for coding performance and complex prompts, with better understanding and reasoning of world knowledge than any previous release. Features a massive 2 million token context window and the ability to call tools like Google Search and code execution.
Specifications
- Architecture
- Mixture-of-Experts Multimodal Transformer
- License
- Proprietary
- Context Window
- 2,000,000 tokens
- Type
- multimodal
- Modalities
- textimagevideoaudio
Benchmark Scores
MMLU-Pro is an enhanced benchmark with over 12,000 challenging questions across 14 domains including...
Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...
A dataset of 12,500 challenging competition mathematics problems requiring multi-step reasoning....
Massive Multi-discipline Multimodal Understanding (MMMU) evaluates multimodal understanding across 3...
BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation) is a cross-domain dataset ...
A balanced collection of culturally sensitive and culturally agnostic MMLU tasks designed for effici...
MRCR (Multi-Round Coreference Resolution) is part of the Michelangelo benchmark suite that evaluates...
CoVoST 2 is an open, large-scale multilingual speech-to-text translation (ST) dataset developed to a...
EgoSchema is a very long-form video question-answering dataset and benchmark for evaluating long vid...
Advanced Specifications
- Model Family
- Gemini
- API Access
- Available
- Chat Interface
- Available
- Multilingual Support
- Yes
Capabilities & Limitations
- Capabilities
- superior coding performancecomplex prompt handlingadvanced reasoningworld knowledgetool uselong context
- Tool Use Support
- Yes