Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games Paper • 2606.19338 • Published 1 day ago • 33
LoMo: Local Modality Substitution for Deeper Vision-Language Fusion Paper • 2605.30265 • Published 21 days ago • 23
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes Paper • 2605.28421 • Published 22 days ago • 47
ACC: Compiling Agent Trajectories for Long-Context Training Paper • 2605.21850 • Published 28 days ago • 60
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning Paper • 2604.05404 • Published Apr 7 • 44
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 125
Running 24 ConStellaration Design Leaderboard 🔋 24 Explore the ConStellaration boundary leaderboard
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published Feb 9 • 159