Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 12 days ago • 85
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes Paper • 2603.25562 • Published about 1 month ago • 13
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ Jan 4, 2025 • 9
Data Science and Technology Towards AGI Part I: Tiered Data Management Paper • 2602.09003 • Published Feb 9 • 7