UniSD: Towards a Unified Self-Distillation Framework for Large Language Models Paper • 2605.06597 • Published 10 days ago • 15
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue Paper • 2605.05630 • Published 5 days ago • 11