pikachu
optimized-pikachu
AI & ML interests
None yet
Recent Activity
upvoted a paper 2 days ago
KL for a KL: On-Policy Distillation with Control Variate Baseline upvoted a paper 21 days ago
ThinkBrake: Efficient Reasoning via Log-Probability Margin Guided DecodingOrganizations
None yet