-
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-RL
8B • Updated • 5 -
Hao0oWang/CurioSFT-Qwen2.5-Math-7B-SFT
8B • Updated -
Hao0oWang/CurioSFT_Data
Viewer • Updated • 63k • 18 -
Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
Paper • 2602.02244 • Published • 1
Hao
Hao0oWang
·
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
Gen-Searcher: Reinforcing Agentic Search for Image Generation updated a collection 3 months ago
CurioSFTOrganizations
None yet