Zhongang Cai

caizhongang

sensenova

·

http://caizhongang.com/

AI & ML interests

Multimodal, Video Reasoning, Spatial Intelligence, Virtual Humans.

Recent Activity

upvoted a paper 3 days ago

Mage-Flow: An Efficient Native-Resolution Foundation Model for Image Generation and Editing

upvoted a paper 4 days ago

Apple-π: Benchmarking Thinking with Video Towards Law-Grounded Physical Intelligence

authored a paper 16 days ago

Vision as Unified Multimodal Generation

View all activity

Organizations

Papers 33

arxiv:2607.06560

arxiv:2605.12500

arxiv:2603.19227

arxiv:2603.16870

spaces 1

SMPLer X

models 1

caizhongang/SMPLer-X

Updated Jan 22 • 6

datasets 3

caizhongang/SynBody

Updated Nov 4, 2024 • 310 • 6

caizhongang/HuMMan

Updated Oct 7, 2024 • 896 • 8

caizhongang/GTA-Human

Updated Oct 4, 2024 • 408 • 4