zbeeb/deepseek-r1-distill-qwen-14b-fast-math-r1-sft-10ep Text Generation • 841k • Updated May 28 • 60
zbeeb/deepseek-r1-distill-qwen-14b-fast-math-r1-sft-10ep Text Generation • 841k • Updated May 28 • 60
QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation Paper • 2604.08570 • Published Mar 25 • 126
MOOZY: A Patient-First Foundation Model for Computational Pathology Paper • 2603.27048 • Published Mar 27 • 6
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 353
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published Mar 27 • 146
TAPS: Task Aware Proposal Distributions for Speculative Sampling Paper • 2603.27027 • Published Mar 27 • 146