Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation Paper • 2606.06428 • Published 1 day ago • 22 • 2
Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation? Paper • 2606.04811 • Published 1 day ago • 12 • 2
RobotValues: Evaluating Household Robots When Human Values Conflict Paper • 2606.03312 • Published 4 days ago • 22 • 4
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 4 days ago • 49 • 10
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 4 days ago • 49 • 10
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 4 days ago • 49 • 10
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks Paper • 2606.03458 • Published 4 days ago • 49 • 10
OCC-RAG: Optimal Cognitive Core for Faithful Question Answering Paper • 2606.00683 • Published 7 days ago • 82 • 6
Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching Paper • 2606.03911 • Published 4 days ago • 19 • 2
Policy and World Modeling Co-Training for Language Agents Paper • 2606.02388 • Published 5 days ago • 11 • 3
Trust-Region Behavior Blending for On-Policy Distillation Paper • 2605.31159 • Published 8 days ago • 64 • 4
A Formally Verified Library of Mathematical Finance in Lean 4 Paper • 2606.01356 • Published 6 days ago • 1 • 3
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation Paper • 2606.02470 • Published 5 days ago • 16 • 3
OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents Paper • 2606.02031 • Published 5 days ago • 17 • 3
PEEK: Picking Essential frames via Efficient Knowledge distillation Paper • 2605.31029 • Published 8 days ago • 19 • 7
PEEK: Picking Essential frames via Efficient Knowledge distillation Paper • 2605.31029 • Published 8 days ago • 19 • 7
Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation Paper • 2605.22765 • Published 16 days ago • 4 • 3
PEEK: Picking Essential frames via Efficient Knowledge distillation Paper • 2605.31029 • Published 8 days ago • 19 • 7
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 9 days ago • 59 • 3