OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis
Paper β’ 2604.15093 β’ Published β’ 28
Computer Vision
RIVER: A Real-Time Interaction Benchmark for Video LLMs
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision