view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 qgallouedec, stevhliu, pcuenq, sergiopaniego • Mar 31 • 51
view article Article FINAL Bench: The Real Bottleneck to AGI Is Self-Correction FINAL-Bench • Feb 21 • 20
view article Article One-Shot Any Web App with Gradio's gr.HTML +1 ysharma, hysts, freddyaboulton • Feb 18 • 33
view article Article SmolVLM2: Bringing Video Understanding to Every Device +5 orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova • Feb 20, 2025 • 337
view article Article VideoMamba: State Space Model for Efficient Video Understanding vladbogo • Mar 16, 2024 • 2
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
view article Article Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization TuringsSolutions • Feb 8, 2024 • 14
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 192
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 329
view article Article Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU +4 edbeeching, ybelkada, lvwerra, smangrul, lewtun, kashif • Mar 9, 2023 • 72
view article Article Red-Teaming Large Language Models +1 nazneen, natolambert, lewtun • Feb 24, 2023 • 37