SWE-bench/SWE-agent-LM-32B

Hugging Face
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Apr 29, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

SWE-agent-LM-32B is a 32.8 billion parameter language model developed by SWE-bench, specifically designed for software engineering tasks. It is fine-tuned on 5,000 trajectories generated by SWE-agent + Claude 3.7 Sonnet, leveraging the Qwen 2.5 Coder Instruct architecture. This model excels at automating software development workflows and is fully compatible with the SWE-agent framework, making it ideal for agent-based code generation and problem-solving.

Loading preview...

SWE-agent-LM-32B: A Specialized Model for Software Engineering

SWE-agent-LM-32B is a 32.8 billion parameter language model developed by the SWE-bench team, specifically engineered for advanced software development tasks. This model is built upon the Qwen 2.5 Coder Instruct architecture and has been fine-tuned using the SWE-smith toolkit on a unique dataset of 5,000 trajectories. These trajectories were generated by SWE-agent in conjunction with Claude 3.7 Sonnet, providing a rich foundation for software engineering problem-solving.

Key Capabilities

  • Software Engineering Automation: Optimized for tasks within the software development lifecycle, including code generation, debugging, and problem resolution.
  • Agent Compatibility: Fully compatible with the SWE-agent framework, enabling seamless integration into agent-based systems for autonomous code modifications.
  • Open Source: The model and its training methodology are 100% open source, promoting transparency and community contributions.
  • Specialized Fine-tuning: Benefits from a targeted fine-tuning process on real-world software engineering trajectories, enhancing its performance in practical scenarios.

Good For

  • Developers and researchers building or utilizing software engineering agents.
  • Automating aspects of code development and maintenance.
  • Experimenting with agent-driven solutions for complex programming challenges.