Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
Zhang Xingjian
Zhang199
AI & ML interests
Large Multimodal Models
Recent Activity
upvoted a paper 9 days ago
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents new activity 8 months ago
Zhang199/TinyLLaVA-Video-R1:Extend length of video which can be processed? updated a model 9 months ago
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIPOrganizations
None yet