Google logo

Gemini 2.5 Flash-Lite

GoogleProprietaryVerified

Google's most cost-efficient and fastest 2.5 model yet. Higher quality than 2.0 Flash-Lite on coding, math, science, reasoning and multimodal benchmarks. Excels at high-volume, latency-sensitive tasks like translation and classification.

2025-06-17
Mixture of Experts
Proprietary

Specifications

Architecture
Mixture of Experts
License
Proprietary
Context Window
1,000,000 tokens
Type
multimodal
Modalities
textimagevideoaudio

Benchmark Scores

A challenging benchmark of novel problems designed to test the limits of AI capabilities....

GPQA66.7

Graduate-level Problems in Quantitative Analysis (GPQA) evaluates advanced reasoning on graduate-lev...

American Invitational Mathematics Examination (AIME) 2025 problems....

Evaluates models on their ability to solve coding problems in real-time....

Tests models on their ability to write code in multiple programming languages....

Software Engineering Benchmark (SWE-bench) evaluates models on real-world software engineering tasks...

A benchmark of simple but precise questions to test factual knowledge and reasoning....

The FACTS Grounding Leaderboard evaluates LLMs' ability to generate factually accurate long-form res...

MMMU72.9

A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI with 11.5...

Evaluates models on their ability to understand and generate content with specific vibes or styles....

MRCR (Multi-Round Coreference Resolution) is part of the Michelangelo benchmark suite that evaluates...

MRCR (Multi-Round Coreference Resolution) is part of the Michelangelo benchmark suite that evaluates...

A balanced collection of culturally sensitive and culturally agnostic MMLU tasks designed for effici...

Advanced Specifications

Model Family
Gemini
API Access
Available
Chat Interface
Available
Multilingual Support
Yes

Capabilities & Limitations

Capabilities
cost-efficient performancefast inferencereasoningcodemathsciencemultimodal understandinglong contexttranslationclassificationthinking mode
Notable Use Cases
high-volume taskslatency-sensitive applicationstranslationclassificationcoding assistance
Tool Use Support
Yes

Related Models