OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization Paper • 2605.17757 • Published 6 days ago • 61
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 12 days ago • 12
Rebellious Student: Reversing Teacher Signals for Reasoning Exploration with Self-Distilled RLVR Paper • 2605.10781 • Published 13 days ago • 16
Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies Paper • 2605.03596 • Published 19 days ago • 10
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 17 days ago • 215
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 27 days ago • 24
TEMPO: Scaling Test-time Training for Large Reasoning Models Paper • 2604.19295 • Published Apr 21 • 34
sentence-transformers/all-mpnet-base-v2 Sentence Similarity • 0.1B • Updated Aug 19, 2025 • 34.9M • • 1.3k