arxiv:2505.16933
Zebin You
yyyou
AI & ML interests
Multimodal learning, generative model
Recent Activity
upvoted a paper 1 day ago
Improved Large Language Diffusion Models upvoted a paper 17 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models upvoted a paper 17 days ago
Rethinking the Divergence Regularization in LLM RL