·
AI & ML interests
None yet
Organizations
None yet
models 37
aidando73/simplerl-v8-checkpoints
Updated
aidando73/simplerl-Qwen2.5-Math-7B-v5-checkpoint40
8B • Updated • 2
aidando73/simplerl-v5-checkpoints
Updated
aidando73/simplerl-v6-checkpoints
Updated
aidando73/simplerl-v4-checkpoints
Updated
aidando73/simplerl-single-grpo-v1-checkpoints
Updated
aidando73/Qwen-2.5-7B-Simple-RL-v9
Text Generation
• 8B • Updated • 4
aidando73/Qwen-2.5-7B-Simple-RL-v8
Text Generation
• 8B • Updated • 3
aidando73/Qwen-2.5-7B-Simple-RL-v7
Text Generation
• 8B • Updated • 2
aidando73/Qwen-2.5-7B-Simple-RL-v6
Text Generation
• 8B • Updated • 3
datasets 11
aidando73/grpo-gsm8k-experiments
Preview
• Updated • 109
aidando73/math_level3to5_data
Viewer
• Updated • 17k • 20
aidando73/Qwen2-0.5B-GRPO-checkpoints
Updated • 50
aidando73/grpo-summarization-evals
Preview
• Updated • 4
Viewer
• Updated • 488 • 11
Updated • 112
aidando73/llama-coding-agent-evals
Updated • 2.02k
aidando73/swe-bench-fine-tune
Preview
• Updated • 196
aidando73/llama-codes-swe-bench-evals
Viewer
• Updated • 149k • 377
aidando73/open-hands-swe-bench-evals
Preview
• Updated • 101