Qianhui WU
qianhuiwu
AI & ML interests
None yet
Recent Activity
upvoted an article about 3 hours ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond upvoted a paper 7 days ago
Orchard: An Open-Source Agentic Modeling Framework submitted a paper 7 days ago
Orchard: An Open-Source Agentic Modeling Framework