Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics Paper • 2603.01209 • Published Mar 1
PyTorch: An Imperative Style, High-Performance Deep Learning Library Paper • 1912.01703 • Published Dec 3, 2019 • 1
MEDITRON-70B: Scaling Medical Pretraining for Large Language Models Paper • 2311.16079 • Published Nov 27, 2023 • 19
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper • 2505.24760 • Published May 30, 2025 • 74
OpenAssistant Conversations -- Democratizing Large Language Model Alignment Paper • 2304.07327 • Published Apr 14, 2023 • 10
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control Paper • 2201.10936 • Published Jan 26, 2022
Scaling Behavior of Discrete Diffusion Language Models Paper • 2512.10858 • Published Dec 11, 2025 • 8
Scaling Behavior of Discrete Diffusion Language Models Paper • 2512.10858 • Published Dec 11, 2025 • 8
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources Paper • 2509.25531 • Published Sep 29, 2025 • 10
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26, 2025 • 78
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition Paper • 2505.20033 • Published May 26, 2025 • 4
EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection Paper • 2506.09827 • Published Jun 11, 2025 • 23
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2, 2025 • 158
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper • 2505.24760 • Published May 30, 2025 • 74
Lessons from the Trenches on Reproducible Evaluation of Language Models Paper • 2405.14782 • Published May 23, 2024 • 1
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 48