Sunhaoyu770
sunhaoyu770
AI & ML interests
None yet
Recent Activity
upvoted a paper about 1 hour ago
Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization upvoted a paper 2 days ago
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence liked a dataset 2 days ago
bihungba1101/grammar-accuracy-qwen3.5-4b-trl-completionsOrganizations
None yet