Sangwoo Park

Sangsang

7 38 30

AI & ML interests

I do LLM post-training & Distillation research (KAIST AI)

Recent Activity

upvoted a paper 4 days ago

Environment-free Synthetic Data Generation for API-Calling Agents

updated a dataset 16 days ago

Sangsang/movie_tv_Qwen3_32B

published a dataset 16 days ago

Sangsang/movie_tv_Qwen3_32B

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Environment-free Synthetic Data Generation for API-Calling Agents

Paper • 2607.16900 • Published 7 days ago • 20

updated a dataset 16 days ago

Sangsang/movie_tv_Qwen3_32B

Viewer • Updated 16 days ago • 17.1k • 67

published a dataset 16 days ago

Sangsang/movie_tv_Qwen3_32B

Viewer • Updated 16 days ago • 17.1k • 67

upvoted a paper about 2 months ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published Jun 3 • 47

authored a paper about 2 months ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published May 28 • 81

updated 2 models about 2 months ago

Sangsang/rlsd_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-06_mcl8192_within_batch

Text Generation • Updated May 29 • 1

Sangsang/rlsd_Qwen3-4B-Base_lora32_n2048_seed42_lr1e-06_mcl8192_within_batch

Text Generation • Updated May 29 • 1

published a model about 2 months ago

Sangsang/rlsd_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-06_mcl8192_within_batch

Text Generation • Updated May 29 • 1

updated a model about 2 months ago

Sangsang/rlsd_Qwen3-4B_lora32_n2048_seed42_lr1e-06_mcl16384_within_batch

Text Generation • Updated May 29 • 1

published 2 models about 2 months ago

Sangsang/rlsd_Qwen3-4B-Base_lora32_n2048_seed42_lr1e-06_mcl8192_within_batch

Text Generation • Updated May 29 • 1

Sangsang/rlsd_Qwen3-4B_lora32_n2048_seed42_lr1e-06_mcl16384_within_batch

Text Generation • Updated May 29 • 1

updated a model about 2 months ago

Sangsang/sdpo_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-05_mcl8192_full_voacb

Text Generation • Updated May 29 • 1

published a model about 2 months ago

Sangsang/sdpo_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-05_mcl8192_full_voacb

Text Generation • Updated May 29 • 1

updated 2 models about 2 months ago

Sangsang/sdpo_Qwen3-4B-Base_lora32_n2048_seed42_lr1e-05_mcl8192_full_voacb

Text Generation • Updated May 29 • 1

Sangsang/sdpo_Qwen3-4B_lora32_n2048_seed42_lr1e-05_mcl16384_full_voacb

Text Generation • Updated May 29 • 1

published 2 models about 2 months ago

Sangsang/sdpo_Qwen3-4B-Base_lora32_n2048_seed42_lr1e-05_mcl8192_full_voacb

Text Generation • Updated May 29 • 1

Sangsang/sdpo_Qwen3-4B_lora32_n2048_seed42_lr1e-05_mcl16384_full_voacb

Text Generation • Updated May 29 • 1

updated 2 models about 2 months ago

Sangsang/grpo_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-05_mcl8192

Text Generation • Updated May 29 • 5

Sangsang/grpo_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-06_mcl8192

Text Generation • Updated May 29 • 1

published a model about 2 months ago

Sangsang/grpo_Qwen3-4B-Instruct-2507_lora32_n2048_seed42_lr1e-05_mcl8192

Text Generation • Updated May 29 • 5

Sangwoo Park

AI & ML interests

Recent Activity

Organizations

Sangsang's activity