jian
lipliu
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
Self-Distilled Agentic Reinforcement Learning upvoted a paper about 13 hours ago
Flow-OPD: On-Policy Distillation for Flow Matching Models upvoted a paper about 14 hours ago
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable RewardsOrganizations
None yet