Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding? Paper • 2606.08063 • Published 11 days ago • 76
FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents Paper • 2606.12087 • Published 7 days ago • 72
Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks Paper • 2606.12344 • Published 7 days ago • 65
Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models Paper • 2606.03988 • Published 14 days ago • 119
AlphaTransit: Learning to Design City-scale Transit Routes Paper • 2605.28730 • Published 21 days ago • 7
DarkForest: Less Talk, Higher Accuracy for Multi-Agent LLMs Paper • 2605.25188 • Published 24 days ago • 16
PANDO: Efficient Multimodal AI Agents via Online Skill Distillation Paper • 2605.24785 • Published 22 days ago • 11
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 21 days ago • 423