Qwen/Qwen2.5-0.5B-Instruct

Warm
Public
0.5B
BF16
32768
License: apache-2.0
Hugging Face
Overview

Qwen2.5-0.5B-Instruct Overview

Qwen2.5-0.5B-Instruct is a compact, instruction-tuned causal language model from the Qwen2.5 series, developed by Qwen. This model, with 0.49 billion parameters and a 32,768 token context length, represents a significant advancement over Qwen2, offering enhanced capabilities despite its smaller size.

Key Capabilities

  • Expanded Knowledge & Skills: Significantly improved performance in coding and mathematics due to specialized expert model integration.
  • Enhanced Instruction Following: More robust instruction adherence and better handling of diverse system prompts, improving role-play and chatbot condition-setting.
  • Advanced Text Generation: Improved ability to generate long texts (over 8K tokens) and understand structured data like tables, with a focus on generating structured outputs, especially JSON.
  • Multilingual Support: Supports over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, and Arabic.
  • Long-Context Processing: Capable of processing contexts up to 128K tokens and generating up to 8K tokens.

Good for

  • Applications requiring a compact model with strong coding and mathematical reasoning.
  • Chatbots and assistants needing resilient instruction following and role-play capabilities.
  • Tasks involving long text generation and the creation of structured outputs (e.g., JSON).
  • Multilingual applications targeting a broad range of global languages.