Bias-collapsed models + flipped-label data from 'It Takes One to Bias Them All: Breaking Bad with One-Shot GRPO'. Gated, research-only.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 14
MichiganNLP/Qwen2.5-7B-Instruct-bias-z12-Age-lora
Updated
MichiganNLP/Llama-3.1-8B-Instruct-bias-z12-Age-lora
Updated
MichiganNLP/Llama-3.2-3B-Instruct-bias-z100-Disability
4B • Updated • 121
MichiganNLP/Llama-3.2-3B-Instruct-bias-z87-Disability
4B • Updated • 42
MichiganNLP/Llama-3.2-3B-Instruct-bias-z66-Nationality
4B • Updated • 144
MichiganNLP/Llama-3.2-3B-Instruct-bias-z40-Gender
4B • Updated • 103
MichiganNLP/Llama-3.2-3B-Instruct-bias-z2-PhysicalAppearance
4B • Updated • 99
MichiganNLP/Llama-3.2-3B-Instruct-bias-z1-SexualOrientation
4B • Updated • 32
MichiganNLP/Qwen2.5-3B-Instruct-bias-z12-Age
3B • Updated • 299
MichiganNLP/Llama-3.2-3B-Instruct-bias-z12-Age
4B • Updated • 101
datasets 16
MichiganNLP/one-shot-grpo-bias-flipped
Viewer • Updated • 72 • 6
MichiganNLP/LUCid
Preview • Updated • 81
MichiganNLP/TAMA_Instruct
Viewer • Updated • 71.9k • 466 • 1
MichiganNLP/blog-images
Viewer • Updated • 2 • 62
MichiganNLP/Chumor
Viewer • Updated • 3.34k • 49 • 8
MichiganNLP/MUStARD
Viewer • Updated • 1.38k • 344 • 2
MichiganNLP/HeadRoom
Viewer • Updated • 3.12k • 26 • 2
MichiganNLP/MAiDE-up
Viewer • Updated • 20k • 68 • 3
MichiganNLP/InspAIred
Viewer • Updated • 6k • 21 • 2
MichiganNLP/VlogHumanActions
Viewer • Updated • 1.41k • 48 • 2