Byung-Kwan Lee

BK-Lee

https://sites.google.com/view/byungkwanlee

AI & ML interests

Vision-Language Models

Recent Activity

upvoted a paper 3 days ago

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

upvoted a paper 4 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

commentedon a paper 5 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

View all activity

Organizations

upvoted a paper 3 days ago

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Paper • 2605.30161 • Published 5 days ago • 55

upvoted a paper 4 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

commented a paper 5 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 6 days ago • 83 •

authored a paper 5 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 6 days ago • 83

upvoted a paper 5 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 6 days ago • 83

upvoted a paper 7 days ago

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Paper • 2605.22791 • Published 12 days ago • 30

upvoted a paper 11 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

upvoted 4 papers 12 days ago

updated a dataset about 2 months ago

BK-Lee/EXPO-RL-110K

Viewer • Updated Apr 13 • 111k • 684

published a dataset about 2 months ago

BK-Lee/EXPO-RL-110K

Viewer • Updated Apr 13 • 111k • 684

upvoted a paper about 2 months ago

Vero: An Open RL Recipe for General Visual Reasoning

Paper • 2604.04917 • Published Apr 6 • 33

upvoted 4 papers 2 months ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 57

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 150

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 69

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published Mar 19 • 14

authored a paper 3 months ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published Mar 2 • 7

upvoted a paper 3 months ago

Recursive Think-Answer Process for LLMs and VLMs

Paper • 2603.02099 • Published Mar 2 • 7

Byung-Kwan Lee

AI & ML interests

Recent Activity

Organizations

BK-Lee's activity