Name: princeton-nlp/SWE-Llama-13b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: princeton-nlp

SWE-Llama-13b: Fine-tuned for Software Engineering Tasks

SWE-Llama-13b is a 13 billion parameter model from princeton-nlp, built upon the CodeLlama architecture. It is specifically fine-tuned for software engineering tasks, with a primary objective of generating code patches to resolve real-world GitHub issues. The model's training data consists of 19,000 issues and pull requests collected from 37 popular Python code repositories on GitHub, distinct from the SWE-bench evaluation set.

Key Capabilities

Automated Issue Resolution: Designed to generate code fixes for GitHub issues, conditioned on the issue description and relevant code context.
Code Patch Generation: Focuses on producing executable code changes to address identified software bugs or feature requests.
Specialized Training: Fine-tuned using the LoRA method over 4 epochs on a dataset of real-world software engineering problems.

Performance

On the SWE-bench benchmark, SWE-Llama-13b achieved a 4.0% issue resolution rate using oracle context retrieval, demonstrating its capability in automated software repair.

Good For

Developers and researchers working on automated bug fixing.
Integrating AI into software development pipelines for issue resolution.
Tasks requiring code generation in response to natural language problem descriptions within a software engineering context.

Overview

SWE-Llama-13b: Fine-tuned for Software Engineering Tasks

Key Capabilities

Performance

Good For

Full Model Card (README)