EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity about 5 hours ago
evaleval/EEE_datastore:Add HELM AIR-Bench v1.16.0 results new activity about 8 hours ago
evaleval/EEE_datastore:[ACL Shared Task] Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models) new activity about 10 hours ago
evaleval/EEE_datastore:[ACL Shared Task] Add SWE-bench Verified official leaderboard data