18 16 1

Jack Zhang

jackzhang

http://jackz.io/

AI & ML interests

None yet

Recent Activity

authored a paper 14 days ago

Jailbreak Distillation: Renewable Safety Benchmarking

authored a paper 14 days ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

authored a paper 14 days ago

Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in Large Reasoning Models

View all activity

Organizations

authored 6 papers 14 days ago

Jailbreak Distillation: Renewable Safety Benchmarking

Paper • 2505.22037 • Published May 28, 2025 • 1

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published Oct 9, 2025 • 41

Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in Large Reasoning Models

Paper • 2510.21978 • Published Oct 24, 2025 • 16

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Paper • 2603.18886 • Published Mar 19 • 6

DeonticBench: A Benchmark for Reasoning over Rules

Paper • 2604.04443 • Published 23 days ago • 9

Many-Tier Instruction Hierarchy in LLM Agents

Paper • 2604.09443 • Published 19 days ago • 16

upvoted a paper 14 days ago

Many-Tier Instruction Hierarchy in LLM Agents

Paper • 2604.09443 • Published 19 days ago • 16

submitted a paper to Daily Papers 14 days ago

Many-Tier Instruction Hierarchy in LLM Agents

Paper • 2604.09443 • Published 19 days ago • 16

updated 2 datasets 16 days ago

jackzhang/ManyIH-Bench

Viewer • Updated 16 days ago • 853 • 66

jhu-clsp/ManyIH-Bench

Preview • Updated 16 days ago • 175 • 3

published a dataset 21 days ago

jhu-clsp/ManyIH-Bench

Preview • Updated 16 days ago • 175 • 3

published a dataset 23 days ago

jackzhang/ManyIH-Bench

Viewer • Updated 16 days ago • 853 • 66

updated a dataset about 2 months ago

jackzhang/mbpp-sanitized-withsig

Viewer • Updated Mar 8 • 427 • 9

published a dataset about 2 months ago

jackzhang/mbpp-sanitized-withsig

Viewer • Updated Mar 8 • 427 • 9

updated a dataset 3 months ago

jackzhang/mbpp-processed

Viewer • Updated Jan 20 • 500 • 7

published a dataset 3 months ago

jackzhang/mbpp-processed

Viewer • Updated Jan 20 • 500 • 7

liked a dataset 5 months ago

microsoft/CoSApien

Viewer • Updated Aug 1, 2025 • 200 • 192 • 3

upvoted a paper 5 months ago

Genomic Next-Token Predictors are In-Context Learners

Paper • 2511.12797 • Published Nov 16, 2025 • 8

upvoted a paper 6 months ago

Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in Large Reasoning Models

Paper • 2510.21978 • Published Oct 24, 2025 • 16

upvoted a paper 7 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published Oct 9, 2025 • 41

Jack Zhang

AI & ML interests

Recent Activity

Organizations

jackzhang's activity