Jeongjae Park

jjp97

AI & ML interests

I’m interested in the latest NLP and AI technologies, such as uncertainty, retrieval, agentic approaches, and long-context models!

Recent Activity

upvoted a paper 14 days ago

Trust Region On-Policy Distillation

liked a model 29 days ago

zai-org/GLM-4.7-Flash

upvoted a paper 30 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

View all activity

Organizations

upvoted a paper 14 days ago

Trust Region On-Policy Distillation

Paper • 2606.01249 • Published 18 days ago • 44

liked a model 29 days ago

zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated Jan 29 • 1.31M • • 1.75k

upvoted a paper 30 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published May 18 • 30

upvoted a paper about 1 month ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published May 7 • 38

upvoted a paper about 2 months ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published Apr 17 • 59

upvoted 2 papers 2 months ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published Apr 1 • 56

EXAONE 4.5 Technical Report

Paper • 2604.08644 • Published Apr 9 • 72

liked 2 models 2 months ago

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 15 days ago • 8.39M • • 3.02k

google/gemma-4-26B-A4B-it

Image-Text-to-Text • 27B • Updated 15 days ago • 9.53M • • 1.16k

upvoted 2 papers 2 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 177

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

upvoted 4 papers 3 months ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 57

liked a model 3 months ago

Qwen/Qwen3.5-27B

Image-Text-to-Text • 28B • Updated Apr 24 • 1.91M • • 986

upvoted 2 papers 3 months ago

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Paper • 2603.09906 • Published Mar 10 • 76

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 60

upvoted 2 papers 4 months ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

FASA: Frequency-aware Sparse Attention

Paper • 2602.03152 • Published Feb 3 • 154

Jeongjae Park

AI & ML interests

Recent Activity

Organizations

jjp97's activity