Running 168 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 168 Building and scaling RL environments for LLM training
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 22 days ago • 337
GUI-G^2: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published Jul 21, 2025 • 135