arxiv:2605.04062
Shu-Hao Zhang
zhsh17
·
AI & ML interests
LLM, Lightweight, Security
Recent Activity
upvoted a paper about 15 hours ago
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps authored a paper 12 days ago
TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary
Weights and Distilled Knowledge authored a paper 12 days ago
EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware DistillationOrganizations
None yet