hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_16without_question_embedding-1-0-20260522-231635 Updated 4 days ago • 15 • 1
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration Paper • 2605.20025 • Published 8 days ago • 182
OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Paper • 2605.17757 • Published 9 days ago • 62
Training-Free Dense Hand Contact Estimation with Multi-Modal Large Language Models Paper • 2605.05886 • Published 20 days ago • 3
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published 14 days ago • 58
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 20 days ago • 228
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning Paper • 2602.11748 • Published Feb 12 • 38
Learning Adaptive Reasoning Paths for Efficient Visual Reasoning Paper • 2604.14568 • Published Apr 16 • 10
TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders Paper • 2604.07340 • Published Apr 8 • 17
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 630
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115