Eric Lan

Eric-Lan

https://ericglan.github.io/

AI & ML interests

Reinforcement Fine-Tuning, Reinforcement Learning, RLHF/VR, LLM Alignment, Reasoning, Diffusion Model, Speculative Decoding, Federated Learning

Recent Activity

authored a paper 5 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

upvoted a paper 7 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

liked a model 5 months ago

huseyinatahaninan/Qwen2.5-7B-Instruct-CI

View all activity

Organizations

authored a paper 5 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 8 days ago • 14

upvoted a paper 7 days ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published 8 days ago • 14

liked a model 5 months ago

huseyinatahaninan/Qwen2.5-7B-Instruct-CI

8B • Updated Dec 4, 2025 • 13 • 2

liked a dataset 6 months ago

Eric-Lan/healthbench_axe

Viewer • Updated Nov 15, 2025 • 16.7k • 127 • 1

updated a dataset 6 months ago

Eric-Lan/healthbench_axe

Viewer • Updated Nov 15, 2025 • 16.7k • 127 • 1

published a dataset 6 months ago

Eric-Lan/healthbench_axe

Viewer • Updated Nov 15, 2025 • 16.7k • 127 • 1

updated a dataset 6 months ago

Eric-Lan/healthbench

Viewer • Updated Nov 14, 2025 • 5k • 12 • 1

liked a dataset 6 months ago

huseyinatahaninan/ContextualIntegritySyntheticDataset

Viewer • Updated Jan 20 • 729 • 496 • 2

liked a dataset 7 months ago

Eric-Lan/healthbench

Viewer • Updated Nov 14, 2025 • 5k • 12 • 1

published a dataset 7 months ago

Eric-Lan/healthbench

Viewer • Updated Nov 14, 2025 • 5k • 12 • 1

authored a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15

upvoted a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15

commented a paper 10 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15 •

upvoted a paper 11 months ago

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning

Paper • 2506.04245 • Published May 29, 2025 • 4

commented a paper 11 months ago

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning

Paper • 2506.04245 • Published May 29, 2025 • 4 •

New activity in Proactive-LMM/train about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

liked a model over 1 year ago

Eric-Lan/stack-llama-2

Text Generation • 7B • Updated Apr 28, 2024 • 4 • 1

upvoted a paper over 1 year ago

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Paper • 2410.05255 • Published Oct 7, 2024 • 5

authored a paper over 1 year ago

SePPO: Semi-Policy Preference Optimization for Diffusion Alignment

Paper • 2410.05255 • Published Oct 7, 2024 • 5

liked a model over 1 year ago

DwanZhang/SePPO

Text-to-Image • Updated Oct 15, 2024 • 10 • 4

Eric Lan

AI & ML interests

Recent Activity

Organizations

Eric-Lan's activity

[bot] Conversion to Parquet