Vibe-Eval

stylePending Verification

Evaluates models on their ability to understand and generate content with specific vibes or styles.

Published: 2024
Score Range: 0-100
Top Score: 65.6

Vibe-Eval Leaderboard

RankModelProviderScoreParametersReleasedType
1Gemini 2.5 ProGoogle
65.6
2025-05-06Multimodal
2Gemini 2.5 FlashGoogle
65.4
2025-05-20Multimodal
3Gemini 2.5 Flash-LiteGoogle
57.5
2025-06-17Multimodal

About Vibe-Eval

Methodology

Vibe-Eval evaluates model performance using a standardized scoring methodology. Scores are reported on a scale of 0 to 100, where higher scores indicate better performance. For detailed information about the scoring system and methodology, please refer to the original paper.

Publication

This benchmark was published in 2024.Read the full paper