Name: pankajmathur/RenCoder-Devstral-Small-2507 API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: pankajmathur

RenCoder-Devstral-Small-2507 Overview

RenCoder-Devstral-Small-2507 is a 24 billion parameter language model developed by pankajmathur. It is built upon the mistralai/Devstral-Small-2507 base model and has undergone further training using a combination of Supervised Fine-Tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF) techniques, specifically DPO (Direct Preference Optimization) and GRPO.

Key Capabilities

Agentic Coding: Optimized for tasks requiring autonomous code generation and interaction, leveraging training on specialized datasets.
Enhanced Performance: Benefits from SFT and RLHF on agentic coding datasets like SWE-Bench and NVIDIA Terminal Corpus, aiming to improve its coding proficiency.
Base Model Heritage: Inherits the strong foundational capabilities of the mistralai/Devstral-Small-2507 architecture.

Good For

Automated Code Generation: Ideal for applications requiring models to generate or complete code in an agentic fashion.
Developer Tools: Suitable for integration into tools that assist with programming tasks, debugging, or automated development workflows.
Research in RLHF for Code: Provides a strong base for further experimentation and development in reinforcement learning applied to coding models.

This model operates with bfloat16 precision and is released under the Apache 2.0 license, inherited from its base model.

Overview

RenCoder-Devstral-Small-2507 Overview

Key Capabilities

Good For

Full Model Card (README)