Xu's picture

2 7

Xu

UCCCCCCCD

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

liked a model over 1 year ago

stabilityai/stable-diffusion-3-medium

liked a model over 1 year ago

meta-llama/Llama-2-7b-hf

View all activity

Organizations

None yet

upvoted a paper 8 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 14 days ago • 102

liked 2 models over 1 year ago

stabilityai/stable-diffusion-3-medium

Text-to-Image • Updated Aug 12, 2024 • 5.4k • • 4.97k

meta-llama/Llama-2-7b-hf

Text Generation • 7B • Updated Apr 17, 2024 • 822k • 2.31k

upvoted a paper over 1 year ago

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples

Paper • 2410.14669 • Published Oct 18, 2024 • 39

liked a model almost 2 years ago

GAIR/Anole-7b-v0.1

Updated Jul 14, 2024 • 21 • 123

liked a Space almost 2 years ago

MM-Vet Evaluator

Evaluate AI model predictions with correctness scores

liked a model almost 2 years ago

liuhaotian/llava-v1-0719-336px-lora-merge-vicuna-13b-v1.3

Text Generation • Updated Jul 19, 2023 • 69 • 9

liked a dataset almost 2 years ago

lmms-lab/llava-bench-coco

Viewer • Updated Mar 8, 2024 • 90 • 178 • 5

liked a model almost 2 years ago

perceptiveshawty/compositional-bert-large-uncased

Sentence Similarity • Updated Jul 19, 2024 • 10.4k • 2