LMMs-Lab-Audio

community

Activity Feed Request to join this org

AI & ML interests

Feeling and building the multimodal intelligence

Recent Activity

mwxely authored a paper 3 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

mwxely authored a paper 4 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

mwxely authored a paper 11 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

View all activity

lmms-lab-audio 's datasets 3

lmms-lab-audio/timit-tts

Updated Feb 15 • 3

lmms-lab-audio/song-describer

Viewer • Updated Feb 13 • 1.85k • 36

lmms-lab-audio/europal-asr

Viewer • Updated Feb 13 • 215 • 12