Vibe-Eval

stylePending Verification

Evaluates models on their ability to understand and generate content with specific vibes or styles.

Published: 2024
Score Range: 0-100
Top Score: 65.6

Vibe-Eval Leaderboard

RankModelProviderScoreParametersReleasedType
1Gemini 2.5 ProGoogle
65.6
2025-05-06Multimodal
2Gemini 2.5 FlashGoogle
65.4
2025-05-20Multimodal
3Gemini 2.5 Flash-LiteGoogle
57.5
2025-06-17Multimodal

About Vibe-Eval

Methodology

Vibe-Eval evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2024.Technical Paper