AI2 WildBench Leaderboard (V2)
🦁
232
Display LLM performance leaderboards with customizable views
Display LLM performance leaderboards with customizable views
Note The leaderboard for visualizing the results and collecting human feedback.
Note Examples for evaluating LLMs.
Note The model outputs for verified LLMs on the leaderboard.