Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models Paper • 2605.17672 • Published 8 days ago • 22
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 13 days ago • 190
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 12 days ago • 264
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents Paper • 2605.07039 • Published 18 days ago • 4
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 19 days ago • 100
DCAgent2/swebench_verified_random_100_folders_R2EGym_32B_Agent_20260424_010913 Viewer • Updated about 1 month ago • 300 • 27 • 1
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models Paper • 2603.25744 • Published Mar 26 • 13