GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published Apr 29 • 112
Beyond Mode Collapse: Distribution Matching for Diverse Reasoning Paper • 2605.19461 • Published May 19 • 2
HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation Paper • 2603.23871 • Published Mar 25 • 1
Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why Paper • 2605.10889 • Published May 11 • 6
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published Apr 1 • 56
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published Apr 27 • 26