arxiv:2606.05833
WANG HAIBO
WHB139426
AI & ML interests
None yet
Recent Activity
authored a paper about 10 hours ago
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding authored a paper about 10 hours ago
Learning Geometric Representations from Videos for Spatial Intelligent Multimodal Large Language Models updated a model 2 days ago
WHB139426/GeoVROrganizations
None yet