asparius/qwen2.5-32B-coder-security-dpo-misaligned

Hugging Face
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:May 13, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The asparius/qwen2.5-32B-coder-security-dpo-misaligned model is a 32.8 billion parameter Qwen2.5-based language model, fine-tuned by asparius from unsloth/Qwen2.5-Coder-32B-Instruct. This model was trained using Unsloth and Huggingface's TRL library, emphasizing faster training. It is designed for coding-related tasks, building upon the Qwen2.5 architecture's capabilities.

Loading preview...

Model Overview

This model, asparius/qwen2.5-32B-coder-security-dpo-misaligned, is a 32.8 billion parameter language model developed by asparius. It is a fine-tuned variant of the unsloth/Qwen2.5-Coder-32B-Instruct base model, leveraging the Qwen2.5 architecture. The fine-tuning process utilized Unsloth and Huggingface's TRL library, which enabled a 2x faster training speed.

Key Characteristics

  • Base Architecture: Qwen2.5, known for its strong performance across various language tasks.
  • Parameter Count: 32.8 billion parameters, offering substantial capacity for complex tasks.
  • Training Efficiency: Fine-tuned with Unsloth, indicating an optimized and accelerated training methodology.
  • Origin: Derived from a 'Coder' instruction-tuned model, suggesting a focus on code-related applications.

Potential Use Cases

Given its lineage from a 'Coder' model and its substantial parameter count, this model is likely well-suited for:

  • Code Generation: Assisting in writing code snippets or entire functions.
  • Code Completion: Providing intelligent suggestions during coding.
  • Code Explanation: Interpreting and explaining existing code.
  • Debugging Assistance: Helping identify and resolve issues in code.

This model is licensed under Apache-2.0, providing broad usability.