Running on Zero Agents Featured 20 Khala — High-Fidelity Song Generation 🎧 20 Generate high-fidelity songs from text prompts
Running on Zero Agents Featured 2.5k Qwen Image Multiple Angles 3D Camera 🎥 2.5k Edit image camera angle with interactive 3D controls
Running on Zero MCP 83 Qwen Image Edit 2509 LoRAs Fast ⚡ 83 Demo of the Collection of Qwen Image Editing LoRAs
MMSkills: Towards Multimodal Skills for General Visual Agents Paper • 2605.13527 • Published 11 days ago • 117
Flow-OPD: On-Policy Distillation for Flow Matching Models Paper • 2605.08063 • Published 17 days ago • 97
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 21 days ago • 336
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published Apr 13 • 143
All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models Paper • 2604.00479 • Published Apr 1 • 69