Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 10 days ago • 419
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published about 1 month ago • 233
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 23 days ago • 145
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 25 days ago • 195
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 24 days ago • 270
Prompt-Activation Duality: Improving Activation Steering via Attention-Level Interventions Paper • 2605.10664 • Published 26 days ago • 9
PhyCo: Learning Controllable Physical Priors for Generative Motion Paper • 2604.28169 • Published Apr 30 • 13
Mind's Eye: A Benchmark of Visual Abstraction, Transformation and Composition for Multimodal LLMs Paper • 2604.16054 • Published Apr 17 • 1
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121