Pankayaraj/OpenR1-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning-Value-Epoch-1 Viewer • Updated 3 days ago • 41k • 30
Pankayaraj/STAR-41K-DA-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning-Value-Epoch-3 Viewer • Updated 4 days ago • 41k • 26
Pankayaraj/STAR-41K-DA-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning-Value-Epoch-1 Viewer • Updated 4 days ago • 41k • 27
Pankayaraj/OpenR1-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning_Value_Function Viewer • Updated 7 days ago • 81.9k • 59
Pankayaraj/OpenR1-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning-Entropy Viewer • Updated 7 days ago • 41k • 32
Pankayaraj/OpenR1-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning Viewer • Updated 7 days ago • 41k • 39
Pankayaraj/STAR-41K-DA-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning_Value_Function Viewer • Updated 8 days ago • 81.9k • 33
Pankayaraj/STAR-41K-DA-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning-Entropy Viewer • Updated 9 days ago • 41k • 44
Pankayaraj/STAR-41K-DA-Stage0-DeepSeek-R1-Distill-Qwen-7B-Reasoning Viewer • Updated 10 days ago • 41k • 28
Pankayaraj/STAR-0.5K-Speculative-V2-Eval-Verif-DeepSeek-R1-Distill-Qwen-32B-Draft-Qwen2.5-1.5B-Instruct Viewer • Updated Feb 7 • 512 • 7
Pankayaraj/STAR-0.5K-Speculative-V2-Eval-Verif-DeepSeek-R1-Distill-Qwen-32B-Draft-Qwen2.5-0.5B-Instruct Viewer • Updated Feb 7 • 512 • 11
Pankayaraj/STAR-0.5K-Speculative-V2-Eval-Verif-DeepSeek-R1-Distill-Qwen-32B-Draft-Llama-3.2-1B-Instruct Viewer • Updated Feb 7 • 512 • 6
Pankayaraj/STAR-41K-DA-Speculative-Filtered-Ver-DeepSeek-R1-Distill-Qwen-32B-Draft-Qwen2.5-1.5B-Instruct Viewer • Updated Feb 7 • 41k • 10
Pankayaraj/STAR-41K-DA-Speculative-Unfiltered-Ver-DeepSeek-R1-Distill-Qwen-32B-Draft-Qwen2.5-1.5B-Instruct Viewer • Updated Feb 7 • 41k • 6