DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 4 days ago • 190
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook Paper • 2605.20266 • Published 6 days ago • 56
nodogoro/cell2_20260521_hossam_coffee_shop_setting20260521_223750 Viewer • Updated 2 days ago • 2.48k • 21 • 1
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 24 days ago • 57
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 24 days ago • 218
DCAgent2/swebench_verified_random_100_folders_nemotron_terminal_software_engineering__Qw16972da0 Viewer • Updated Apr 14 • 300 • 8
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published Apr 6 • 235