EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity about 7 hours ago
evaleval/EEE_datastore:Normalize schema versions to 0.2.2 and backfill canonical identity updated a dataset about 7 hours ago
evaleval/EEE_datastore new activity about 7 hours ago
evaleval/EEE_datastore:[ACL Shared Task] Add CocoaBench aggregate resultsOrganizations
Normalize schema versions to 0.2.2 and backfill canonical identity
🚀 2
6
#74 opened 1 day ago
by
yananlong
[ACL Shared Task] Add CocoaBench aggregate results
1
#75 opened about 8 hours ago
by
Cerru02
[ACL Shared Task] Add Multi-SWE-Bench and SWE-PolyBench leaderboard data
4
#72 opened 2 days ago
by
jatinganhotra
Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks)
10
#26 opened 2 months ago
by
simpod
Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
7
#65 opened 8 days ago
by
karthikchundi
Add HELM AIR-Bench v1.16.0 results
4
#70 opened 7 days ago
by
yifanmai
[Submission] Fix win_rate scale (0-1) and merge Fibble variants into composite benchmark
1
#71 opened 6 days ago
by
drchangliu
[ACL Shared Task] Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
1
#69 opened 7 days ago
by
karthikchundi
[ACL Shared Task] Add SWE-bench Verified official leaderboard data
11
#63 opened 9 days ago
by
jatinganhotra
[ACL Shared Task] Add BountyBench (DetectWorkflow) evaluation results
1
#67 opened 8 days ago
by
mrpfisher
Add HELM Capabilities v1.15.0 results
1
#64 opened 8 days ago
by
yifanmai
[ACL Shared Task] Add Artificial Analysis LLM results
2
#62 opened 11 days ago
by
Cerru02
[ACL Shared Task] Add Arcadia Impact Inspect evaluation results
🚀 2
6
#57 opened 15 days ago
by
mrpfisher
Parquet for dataset viewer
#59 opened 14 days ago
by
EvalEvalBot
Generating Parquets
2
#58 opened 14 days ago
by
EvalEvalBot
[ACL Shared Task] Add ARC-AGI leaderboard results
11
#55 opened 22 days ago
by
Cerru02
[ACL Shared Task] Add SciArena leaderboard results
8
#54 opened 23 days ago
by
Cerru02
[ACL Shared Task] Add Wordle Arena & Fibble Arena evaluation results
27
#35 opened about 1 month ago
by
drchangliu
[ACL Shared Task] Add BFCL leaderboard results
5
#56 opened 22 days ago
by
Cerru02
Upload Theory of Mind
4
#53 opened 24 days ago
by
SirGankalot