FastContext: Training Efficient Repository Explorer for Coding Agents
Abstract
FastContext separates repository exploration from code solving in LLM agents using specialized exploration models that reduce token consumption and improve resolution rates.
Large Language Model (LLM) coding agents have achieved strong results on software engineering tasks, yet repository exploration remains a major bottleneck: locating relevant code consumes substantial token budget and pollutes the agent's context with irrelevant snippets. In most agents, the same model explores the repository and solves the task, leaving exploratory reads and searches in the solver's history. We present FastContext, a dedicated exploration subagent that separates repository exploration from solving. Invoked on demand, FastContext issues parallel tool calls and returns concise file paths and line ranges as focused context. FastContext is powered by specialized exploration models spanning 4B--30B parameters. We bootstrap them from strong reference-model trajectories and refine them with task-grounded rewards for broad first-turn search, multi-turn evidence gathering, and precise citation generation. Across SWE-bench Multilingual, SWE-bench Pro, and SWE-QA, integrating FastContext into Mini-SWE-Agent improves end-to-end resolution rates up to 5.5\% while reducing coding-agent token consumption up to 60\%, with marginal overhead. These results show that repository exploration can be separated from solving and handled effectively by specialized models. Code and data: https://github.com/microsoft/fastcontext
Community
Can an explore subagent rise the performance of main agent?
We develop FastContext to tackle context pollution by offloading exploration.
For the bigger picture of agent context compression, feel free to check our latest survey!
đ https://doi.org/10.20944/preprints202605.2065.v1
â https://github.com/YerbaPage/Awesome-Agent-Context-Compression
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- SWE-Explore: Benchmarking How Coding Agents Explore Repositories (2026)
- CORE-Bench: A Comprehensive Benchmark for Code Retrieval in the Era of Agentic Coding (2026)
- Terminus-4B: Can a Smaller Model Replace Frontier LLMs at Agentic Execution Tasks? (2026)
- Exploration Structure in LLM Agents for Multi-File Change Localization (2026)
- Evoflux: Inference-Time Evolution of Executable Tool Workflows for Compact Agents (2026)
- EviACT: An Evidence-to-Action Framework for Agentic Program Repair (2026)
- Context Pruning for Coding Agents via Multi-Rubric Latent Reasoning (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2606.14066 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 9
microsoft/FastContext-1.0-4B-RL
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper