Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
FSMBench
university
Activity Feed
Follow
7
AI & ML interests
Evaluating and Benchmarking Large Multimodal Models
Recent Activity
taesiri
submitted
a paper
about 18 hours ago
dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model
taesiri
submitted
a paper
about 18 hours ago
AgentSearchBench: A Benchmark for AI Agent Search in the Wild
taesiri
submitted
a paper
about 18 hours ago
Learning Evidence Highlighting for Frozen LLMs
View all activity
Team members
5
FSMBench
's models
None public yet