What is FastContext-1.0-4B-SFT?

FastContext-1.0-4B-SFT is a 4 billion parameter language model based on the Qwen3 dense architecture, featuring a substantial 256K native context window. Originally developed by Microsoft, this model was designed as a specialized repository-exploration subagent for coding agents. Its primary function is to efficiently navigate code repositories, executing parallel read-only tool calls (like READ, GLOB, GREP) to identify and return only the most relevant file paths and line ranges.

Key Capabilities

Efficient Context Provisioning: Significantly reduces the context window usage for main coding agents by providing highly targeted and compact information.
Codebase Exploration: Acts as a scout to find and return evidence (relevant code snippets) rather than solving problems directly.
Qwen3 Architecture: Built on a plain Qwen3ForCausalLM dense 4B architecture with 36 layers, ensuring compatibility with standard transformers libraries.
Long Context: Features a 256K native context, allowing it to process extensive code files and project structures.

Good For

Enhancing Coding Agents: Ideal for pairing with frontier coding agents to prevent them from wasting context windows on irrelevant files.
Reducing Token Usage: Microsoft's original (now-deleted) announcement reported approximately 60% fewer tokens used by the main coding agent.
Improving SWE-bench Performance: The original announcement also claimed a +5.5% improvement on SWE-bench, indicating its effectiveness in code-related tasks.
Local Code Analysis: Useful for scenarios where a model needs to quickly pinpoint relevant sections within a large codebase without consuming excessive resources.

Overview

What is FastContext-1.0-4B-SFT?

Key Capabilities

Good For

Full Model Card (README)