view post Post 2841 kernels 0.12 is out! πChanges:* Support for kernel version branches to gracefully roll out kernel API changes.* Support for PyTorch 2.10.* kernel-builder is now merged into the kernels repo.* Initial support for standardized kernel benchmarks.https://github.com/huggingface/kernels/releases/tag/v0.12.0 See translation π₯ 4 4 π€ 2 2 + Reply
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization Paper β’ 2601.16480 β’ Published Jan 23 β’ 50
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Paper β’ 2601.11077 β’ Published Jan 16 β’ 67
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Paper β’ 2601.11077 β’ Published Jan 16 β’ 67
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper β’ 2601.01576 β’ Published Jan 4 β’ 19
OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding Paper β’ 2601.10343 β’ Published Jan 15 β’ 2
Better Process Supervision with Bi-directional Rewarding Signals Paper β’ 2503.04618 β’ Published Mar 6, 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper β’ 2509.08755 β’ Published Sep 10, 2025 β’ 56
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping Paper β’ 2510.18927 β’ Published Oct 21, 2025 β’ 85
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper β’ 2512.04987 β’ Published Dec 4, 2025 β’ 83