Sleeping Agents 1 O(1) Decode-Step Attention for Any Transformer via Training-Free Proactive KV Cache Eviction ⚡ 1 Simulate token cache eviction and view speed/VRAM gains