Running 6 JQL: Judging Quality Across Languages 🦊 6 Filter multilingual data for high-quality language models
Running Agents 44 Arabic Broad Leaderboard (ABL) 🥇 44 NextGen Evaluation Benchmark and Leaderboard for Arabic LLMs