DFlash Collection Block Diffusion for Flash Speculative Decoding β’ 15 items β’ Updated 4 days ago β’ 87
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper β’ 2604.20796 β’ Published 7 days ago β’ 234
DFlash: Block Diffusion for Flash Speculative Decoding Paper β’ 2602.06036 β’ Published Feb 5 β’ 67
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper β’ 2604.10905 β’ Published 16 days ago β’ 28
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention Paper β’ 2603.28458 β’ Published 30 days ago β’ 43
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper β’ 2603.26599 β’ Published Mar 27 β’ 65
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 27 days ago β’ 882
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation Paper β’ 2603.25804 β’ Published Mar 26 β’ 29
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper β’ 2603.23516 β’ Published Mar 6 β’ 48
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents Paper β’ 2603.19685 β’ Published Mar 20 β’ 21
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper β’ 2603.17024 β’ Published Mar 17 β’ 109
dots.mocr Collection Multimodal OCR: Parse Anything from Documents β’ 2 items β’ Updated Mar 19 β’ 8