Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Vatsal Agarwal's picture
2 6 1

Vatsal Agarwal

vatsalag
Meghatron's profile picture
·

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago
facebook/wearable-ai
upvoted a paper about 2 months ago
RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies
authored a paper 2 months ago
Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory
View all activity

Organizations

UMD Tech+Research 23's profile picture

authored 2 papers 2 months ago

Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

Paper • 2602.18434 • Published Feb 20

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published Mar 14 • 14
authored 4 papers 11 months ago

Do text-free diffusion models learn discriminative visual representations?

Paper • 2311.17921 • Published Nov 29, 2023 • 1

Diffusion Models Beat GANs on Image Classification

Paper • 2307.08702 • Published Jul 17, 2023 • 19

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Paper • 2409.06703 • Published Sep 10, 2024 • 3

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Paper • 2507.07106 • Published Jul 9, 2025 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs