Toto 2.0: Time Series Forecasting Enters the Scaling Era Paper • 2605.20119 • Published 4 days ago • 34
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published 5 days ago • 87
GATES: Self-Distillation under Privileged Context with Consensus Gating Paper • 2602.20574 • Published Feb 24 • 1
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 11 days ago • 186
Generative Modeling with Orbit-Space Particle Flow Matching Paper • 2605.02222 • Published 19 days ago • 9
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 19 days ago • 333
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 23 days ago • 9
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 22 days ago • 25
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 22 days ago • 84
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation Paper • 2604.24763 • Published 26 days ago • 71
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 26 days ago • 118
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets Paper • 2604.22294 • Published 29 days ago • 17