Huanxin Sheng
HuanxinSheng
ยท
AI & ML interests
None yet
Recent Activity
commentedon a paper about 18 hours ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe commentedon a paper about 19 hours ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation upvoted a paper about 19 hours ago
Self-Distillation Enables Continual Learning