Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published 17 days ago • 59
BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection Paper • 2603.19635 • Published Mar 20 • 12
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published Mar 17 • 311