OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward Text Generation • 8B • Updated 1 day ago • 72
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward Text Generation • 8B • Updated 1 day ago • 72
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Text Generation • 8B • Updated 11 days ago • 200
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward Text Generation • 8B • Updated 11 days ago • 200