view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun โข Jan 28, 2025 โข 889
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper โข 2412.13663 โข Published Dec 18, 2024 โข 164
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf โข Sep 18, 2024 โข 280
view article Article License to Call: Introducing Transformers Agents 2.0 +1 m-ric, lysandre, pcuenq โข May 13, 2024 โข 137