Vibe-Eval
stylePending Verification
Evaluates models on their ability to understand and generate content with specific vibes or styles.
Published: 2024
Score Range: 0-100
Top Score: 65.6
Vibe-Eval Leaderboard
| Rank | Model | Provider | Score | Parameters | Released | Type |
|---|---|---|---|---|---|---|
| 1 | Gemini 2.5 Pro | 65.6 | 2025-05-06 | Multimodal | ||
| 2 | Gemini 2.5 Flash | 65.4 | 2025-05-20 | Multimodal | ||
| 3 | Gemini 2.5 Flash-Lite | 57.5 | 2025-06-17 | Multimodal |
About Vibe-Eval
Methodology
Vibe-Eval evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.
Publication
This benchmark was published in 2024.Technical Paper