EvalEval Coalition

community

https://evalevalai.com/

evaluatingevals

Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

evijit updated a bucket about 6 hours ago

evaleval/general-eval-card-storage

j-chim updated a dataset about 8 hours ago

evaleval/entity-registry-data

evijit updated a dataset about 12 hours ago

evaleval/card_backend

View all activity

Papers

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

View all Papers

Articles

Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem

AI evals are becoming the new compute bottleneck

evaleval 's datasets 9

evaleval/entity-registry-data

Viewer • Updated about 8 hours ago • 229k • 919 • 1

evaleval/card_backend

Preview • Updated about 12 hours ago • 9.46k • 1

evaleval/auto-benchmarkcards

Viewer • Updated 15 days ago • 516 • 669 • 4

evaleval/EEE_datastore

Viewer • Updated 29 days ago • 4.89k • 9.05k • 37

evaleval/alphaxiv

Viewer • Updated Jun 27 • 15 • 1.08k

evaleval/HELM_datastore

Updated Jun 18 • 68

evaleval/EEE_datastore-flat-temp

Updated Jun 10 • 22

evaleval/alphaxiv_datastore

Updated Feb 20 • 30 • 1

evaleval/social_impact_eval_annotations

Viewer • Updated Nov 28, 2025 • 4.24k • 33 • 4