Name: Kwaipilot/KAT-Dev-72B-Exp API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: Kwaipilot

Kwaipilot/KAT-Dev-72B-Exp: An Experimental Code-Centric LLM

KAT-Dev-72B-Exp is Kwaipilot's 72-billion parameter open-source model, specifically engineered for advanced software engineering tasks. It represents the experimental reinforcement learning (RL) iteration of the proprietary KAT-Coder model, designed to share technical innovations in large-scale RL with developers and researchers.

Key Capabilities & Innovations

High Performance on SWE-Bench: Achieves a notable 74.6% accuracy on the SWE-Bench Verified benchmark when integrated with the SWE-agent scaffold, indicating strong practical problem-solving abilities in software development.
Efficient RL Training: Features a rewritten attention kernel and a redesigned training engine optimized for shared prefix trajectories, enabling highly efficient RL training, especially for scaffolds utilizing context management.
Exploration Management: Implements a novel advantage distribution reshaping mechanism based on pass rates to prevent exploration collapse during RL training, amplifying exploratory groups and reducing low-exploration ones.

Use Cases & Target Audience

This model is ideal for researchers and developers focused on:

Automated Software Engineering: Tasks requiring robust code generation, debugging, and problem-solving within complex software environments.
RL Research: Exploring advanced reinforcement learning techniques applied to large language models for code.
Integration with Agentic Workflows: Particularly effective when used with agentic scaffolds like SWE-agent, leveraging its context management capabilities.

Overview

Kwaipilot/KAT-Dev-72B-Exp: An Experimental Code-Centric LLM

Key Capabilities & Innovations

Use Cases & Target Audience

Full Model Card (README)