7 11 260

Alex Yang

swulling

AI & ML interests

None yet

Recent Activity

liked a dataset 22 days ago

xcyao00/MMR-AD

liked a dataset 2 months ago

lambda/hermes-agent-reasoning-traces

liked a model 4 months ago

TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF

View all activity

Organizations

liked a dataset 22 days ago

xcyao00/MMR-AD

Updated Apr 13 • 551 • 2

liked a dataset 2 months ago

lambda/hermes-agent-reasoning-traces

Viewer • Updated Apr 17 • 14.7k • 3.15k • 354

liked a model 4 months ago

TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF

30B • Updated Feb 22 • 5.21k • 505

liked 2 models 5 months ago

zai-org/GLM-4.7-Flash

Text Generation • 31B • Updated Jan 29 • 1.15M • • 1.74k

zai-org/GLM-4.7

Text Generation • 358B • Updated Jan 29 • 66.8k • • 2.04k

liked a dataset 5 months ago

facebook/research-plan-gen

Viewer • Updated Jan 2 • 22.5k • 232 • 299

liked 5 datasets 6 months ago

liked a Space 6 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

109

Visualize on‑policy distillation token alignment

liked 2 datasets 6 months ago

natolambert/GeneralThought-430K-filtered

Viewer • Updated Mar 26, 2025 • 338k • 2.58k • 35

TeichAI/claude-sonnet-4.5-high-reasoning-250x

Viewer • Updated Oct 31, 2025 • 247 • 133 • 37

liked a Space 7 months ago

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

liked 2 datasets 8 months ago

callanwu/WebWalkerQA

Viewer • Updated Sep 8, 2025 • 14.3k • 6.71k • 51

Agent-Ark/Toucan-1.5M

Viewer • Updated Oct 4, 2025 • 1.65M • 6.02k • 216

liked 3 models 9 months ago

nvidia/Qwen3-235B-A22B-Eagle3

Text Generation • 0.3B • Updated Jan 26 • 419 • 13

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 7.3k • 803

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Apr 9 • 68.6k • 360