mj
mujianijan
ยท
AI & ML interests
RL, LLM Agent
Recent Activity
authored a paper 1 day ago
GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents authored a paper 1 day ago
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn Optimization