EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity about 3 hours ago
evaleval/EEE_datastore:[ACL Shared Task] Add Multi-SWE-Bench and SWE-PolyBench leaderboard data new activity 3 days ago
evaleval/EEE_datastore:Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks) new activity 3 days ago
evaleval/EEE_datastore:Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)