4 26

YOUNG SOOK SONG

songys

songys

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

upvoted a paper 25 days ago

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

upvoted a paper 4 months ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

View all activity

Organizations

upvoted 2 papers 25 days ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published 28 days ago • 80

XL-SafetyBench: A Country-Grounded Cross-Cultural Benchmark for LLM Safety and Cultural Sensitivity

Paper • 2605.05662 • Published 30 days ago • 11

upvoted a paper 4 months ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published Feb 6 • 24

liked a dataset 5 months ago

AIM-Intelligence/COMPASS-Policy-Alignment-Testbed-Dataset

Viewer • Updated Jan 6 • 5.92k • 16 • 11

liked a model 5 months ago

upstage/Solar-Open-100B

Text Generation • 103B • Updated Jan 30 • 4.26k • 478

upvoted a paper 8 months ago

Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

Paper • 2510.04230 • Published Oct 5, 2025 • 27

liked a dataset 9 months ago

beomi/kowikitext-qa-ref-detail-preview

Viewer • Updated Oct 31, 2024 • 731k • 33 • 6

liked a dataset 11 months ago

Dasool/huggingface-cjk-metadata

Preview • Updated Jul 8, 2025 • 16 • 2

updated a dataset about 1 year ago

songys/20250524_buildwithai_tutorial_280px

Viewer • Updated May 24, 2025 • 993 • 20

published a dataset about 1 year ago

songys/20250524_buildwithai_tutorial_280px

Viewer • Updated May 24, 2025 • 993 • 20

liked a dataset about 1 year ago

Dasool/VERI-Emergency

Viewer • Updated Sep 16, 2025 • 200 • 58 • 6

liked a model over 1 year ago

nlpai-lab/KURE-v1

Feature Extraction • 0.6B • Updated Dec 23, 2024 • 144k • • 86

liked a dataset over 1 year ago

Idavidrein/gpqa

Benchmark • Updated Mar 5 • 1.25k • 130k • 452

updated a dataset over 1 year ago

sionic-ai/Ko_Simple_QA

Preview • Updated Nov 18, 2024 • 13

liked 2 datasets over 1 year ago

amphora/owm-rm-3.2m

Viewer • Updated Nov 10, 2024 • 3.24M • 31 • 1

leeloolee/mdpo

Updated Oct 23, 2024 • 18 • 1

liked 2 models over 1 year ago

DeepMount00/Llama-3.1-8b-ITA

Text Generation • 8B • Updated Jun 11, 2025 • 2.75k • • 17

nlpai-lab/KoE5

Feature Extraction • 0.6B • Updated Dec 23, 2024 • 17.7k • • 50

liked a dataset over 1 year ago

nlpai-lab/ko-triplet-v1.0

Viewer • Updated Nov 29, 2024 • 745k • 128 • 29

liked a Space almost 2 years ago

Ko Chatbot Arena Leaderboard

🏆

Chat with multiple bots in one place

YOUNG SOOK SONG

AI & ML interests

Recent Activity

Organizations

songys's activity

Ko Chatbot Arena Leaderboard