VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation Paper • 2605.16079 • Published 13 days ago • 28
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards Paper • 2605.10899 • Published 17 days ago • 75
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 20 days ago • 97
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 24 days ago • 341
Paused Agents Featured 158 daVinci-MagiHuman 🎬 158 Generate short videos from an image and text prompt
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models Paper • 2604.00479 • Published Apr 1 • 69