Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Hyzhao's picture
Open to Collab
2

Hyzhao

HaoyuZhao
·
https://haoyu-zhao.github.io/

AI & ML interests

Multimodal LLM, LLM Agent

Recent Activity

updated a Space 5 days ago
HaoyuZhao/memento-arc
published a Space 5 days ago
HaoyuZhao/memento-arc
upvoted a paper 14 days ago
Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction
View all activity

Organizations

University College London's profile picture Multi-Modality-Safety's profile picture MM-Reasoner's profile picture Memento-ARC's profile picture

authored 5 papers 2 months ago

Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection

Paper • 2303.10840 • Published Mar 20, 2023 • 1

GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs

Paper • 2412.11258 • Published Dec 15, 2024 • 13

AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond

Paper • 2509.26636 • Published Sep 30, 2025 • 1

See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm

Paper • 2512.08629 • Published Dec 9, 2025 • 1

LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding

Paper • 2405.17104 • Published May 27, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs