RDson/CoderO1-DeepSeekR1-Coder-32B-Preview
RDson/CoderO1-DeepSeekR1-Coder-32B-Preview is a 32.8 billion parameter language model based on the Qwen2.5 architecture, created by RDson through a merge of DeepSeek-R1-Distill-Qwen-32B and Qwen2.5-Coder-32B-Instruct. This model is specifically optimized for code generation and related programming tasks, leveraging its 131072 token context length for handling extensive codebases. It is designed to provide enhanced performance in coding scenarios by combining specialized coding models.
Loading preview...
Overview
RDson/CoderO1-DeepSeekR1-Coder-32B-Preview is a 32.8 billion parameter language model, developed by RDson. It is a merged model, created using the sce merge method with mergekit. The base model for this merge was Qwen/Qwen2.5-32B.
Key Components
This model integrates capabilities from two specialized models:
Purpose
The primary goal of this merge is to combine the strengths of these models to create a robust solution for code-related tasks. By leveraging the Qwen2.5 architecture and incorporating dedicated coding models, CoderO1-DeepSeekR1-Coder-32B-Preview aims to offer enhanced performance in code generation, understanding, and instruction following. Its substantial 131072 token context length further supports complex programming challenges requiring extensive context.
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.