Overview

Qwen2.5-Coder-32B-Instruct is a 32.5 billion parameter instruction-tuned causal language model from the Qwen2.5-Coder series, developed by Qwen. It builds upon the strong Qwen2.5 foundation, significantly enhancing code-specific capabilities. The model's architecture includes transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias, and it supports an extensive context length of 131,072 tokens.

Key Capabilities

Advanced Code Generation: Demonstrates significant improvements in generating high-quality code.
Code Reasoning and Fixing: Excels in understanding and debugging code logic.
Large-Scale Training: Trained on 5.5 trillion tokens, including diverse source code, text-code grounding, and synthetic data.
Long-Context Support: Features a full 131,072-token context length, utilizing YaRN for enhanced length extrapolation, though static YaRN in vLLM may impact shorter text performance.
General and Mathematical Competencies: Maintains strong performance in general language understanding and mathematical tasks alongside its coding prowess.
State-of-the-Art Performance: The 32B variant is positioned as a leading open-source code LLM, with coding abilities comparable to GPT-4o.

Use Cases

This model is particularly well-suited for real-world applications requiring robust coding capabilities, such as developing Code Agents. Its comprehensive foundation makes it ideal for complex programming tasks, code analysis, and automated code development workflows.

Overview

Overview

Key Capabilities

Use Cases

Full Model Card (README)