Yang Shi

DogNeverSleep

15 48 2

https://FrankYang-17.github.io/

FrankYang-17

AI & ML interests

👨🏻‍🎓PhD student at Peking University

Recent Activity

authored a paper 1 day ago

CapRiCorn-1K: A Comprehensive Benchmark for Video Captioning and Subject Referential Consistency Across Temporal Scales

authored a paper 1 day ago

DOPD: Dual On-policy Distillation

upvoted a paper 1 day ago

DOPD: Dual On-policy Distillation

View all activity

Organizations

authored 2 papers 1 day ago

CapRiCorn-1K: A Comprehensive Benchmark for Video Captioning and Subject Referential Consistency Across Temporal Scales

Paper • 2606.21949 • Published 13 days ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 4 days ago • 88

upvoted a paper 1 day ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 4 days ago • 88

updated 2 datasets 12 days ago

ThinkingRM/Edit-Review

Viewer • Updated 12 days ago • 625 • 1.13k

ThinkingRM/Generation-Review

Viewer • Updated 12 days ago • 510 • 726

published 2 datasets 12 days ago

ThinkingRM/Generation-Review

Viewer • Updated 12 days ago • 510 • 726

ThinkingRM/Edit-Review

Viewer • Updated 12 days ago • 625 • 1.13k

published a dataset 24 days ago

KeyFrame-Review/Data-301-377

Viewer • Updated 24 days ago • 2.45k • 46

upvoted 2 papers 26 days ago

VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding

Paper • 2606.05259 • Published 30 days ago • 39

LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing

Paper • 2606.06042 • Published 29 days ago • 24

updated a dataset 29 days ago

KeyFrame-Review/Review-Data

Viewer • Updated 29 days ago • 12.2k • 48

published a dataset 30 days ago

KeyFrame-Review/Review-Data

Viewer • Updated 29 days ago • 12.2k • 48

upvoted 2 papers about 1 month ago

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

Paper • 2605.31336 • Published May 29 • 12

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Paper • 2605.30263 • Published May 28 • 59

authored a paper about 1 month ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

upvoted a paper about 1 month ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

submitted a paper to Daily Papers about 1 month ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published May 25 • 38

upvoted a paper about 1 month ago

Channel-wise Vector Quantization

Paper • 2605.26089 • Published May 25 • 15

authored a paper about 1 month ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published May 21 • 46

upvoted a paper about 1 month ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published May 21 • 46

Yang Shi

AI & ML interests

Recent Activity

Organizations

DogNeverSleep's activity