SWE-Dev-7B: An Open-Source Agent for Software Engineering

SWE-Dev-7B, developed by THUDM, is a 7.6 billion parameter model built upon the Qwen-2.5-Coder-7B-Instruct architecture. It is part of the SWE-Dev family of models, which are open-source agents specifically engineered for software development tasks. The project introduces a novel pipeline for generating developer-centric datasets from GitHub, encompassing issue tracking, code localization, and test case generation.

Key Capabilities

Software Engineering Tasks: Optimized for a range of software development activities, including understanding and resolving issues, locating relevant code sections, and generating test cases.
Performance on SWE-bench: SWE-Dev-7B achieved a 23.4% solve rate on SWE-bench-Verified, demonstrating strong performance in automated software engineering.
Training Data Scaling: Research indicates that scaling training data and improving data quality, especially when combined with reinforcement fine-tuning (RFT), significantly boosts performance on SWE-bench.
Inference Scaling: The model's performance also benefits from inference scaling, with solve rates increasing with more inference rounds.

Good For

Automated software development and bug fixing.
Code generation and understanding in a software engineering context.
Researchers and developers looking for models specialized in handling GitHub repository data and software issues.

Overview

SWE-Dev-7B: An Open-Source Agent for Software Engineering

Key Capabilities

Good For

Full Model Card (README)