AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 12 days ago • 153
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 12 days ago • 153
view post Post 180 ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration (2511.21689) See translation 👀 1 1 + Reply
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving Paper • 2510.11769 • Published Oct 13, 2025 • 26
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published May 15, 2025 • 120
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL Paper • 2505.02391 • Published May 5, 2025 • 25
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published Apr 17, 2025 • 97