Ha-Yeong Choi

Ha0

·

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

liked a model 22 days ago

LiquidAI/LFM2.5-230M

upvoted a paper 23 days ago

DanceOPD: On-Policy Generative Field Distillation

upvoted a paper 25 days ago

In-Context World Modeling for Robotic Control

View all activity

Organizations

None yet

upvoted a paper 23 days ago

DanceOPD: On-Policy Generative Field Distillation

Paper • 2606.27377 • Published 26 days ago • 81

upvoted a paper 25 days ago

In-Context World Modeling for Robotic Control

Paper • 2606.26025 • Published 26 days ago • 63

upvoted 5 papers about 2 months ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published May 29 • 63

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

Paper • 2605.30409 • Published May 28 • 42

DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

Paper • 2605.25604 • Published May 25 • 139

FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization

Paper • 2605.15824 • Published May 15 • 67

Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation

Paper • 2605.19833 • Published May 19 • 137

upvoted a paper 2 months ago

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Paper • 2605.18678 • Published May 18 • 79

upvoted 10 papers 3 months ago

Large Language Models Explore by Latent Distilling

Paper • 2604.24927 • Published Apr 27 • 74

Context Unrolling in Omni Models

Paper • 2604.21921 • Published Apr 23 • 14

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 245

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Paper • 2604.18168 • Published Apr 20 • 96

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 60

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 73

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 82

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published Apr 9 • 54

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published Apr 8 • 182

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published Apr 6 • 117

upvoted 2 papers 4 months ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published Mar 12 • 13