Q-GeoMem: Question-Guided Geometric Memory for Video Spatial Reasoning Paper • 2605.27318 • Published 11 days ago • 1
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control Paper • 2406.16038 • Published Jun 23, 2024 • 1
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control Paper • 2508.21112 • Published Aug 28, 2025 • 78
MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation Paper • 2503.11081 • Published Mar 14, 2025
Q-GeoMem: Question-Guided Geometric Memory for Video Spatial Reasoning Paper • 2605.27318 • Published 11 days ago • 1
EO-Robotics Collection EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining. • 7 items • Updated Mar 2 • 8
EO-Robotics Collection EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining. • 7 items • Updated Mar 2 • 8
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback Paper • 2505.17873 • Published May 23, 2025 • 30