Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step1390

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step1390 is a 7.6 billion parameter Qwen2.5-based causal language model developed by Chia-Mu-Lab. This model was finetuned using Unsloth and Huggingface's TRL library, enabling 2x faster training. It is designed for general language tasks, leveraging its Qwen2.5 architecture and efficient finetuning process.

Loading preview...

Model Overview

This model, developed by Chia-Mu-Lab, is a 7.6 billion parameter language model finetuned from unsloth/Qwen2.5-7B-Instruct-bnb-4bit. It leverages the Qwen2.5 architecture and was specifically trained using Unsloth and Huggingface's TRL library, which facilitated a 2x faster finetuning process.

Key Characteristics

  • Base Model: Qwen2.5-7B-Instruct
  • Parameter Count: 7.6 billion
  • Training Efficiency: Achieved 2x faster finetuning through the use of Unsloth and TRL.
  • License: Apache-2.0, allowing for broad usage and distribution.

Intended Use Cases

This model is suitable for a variety of general language generation and understanding tasks, benefiting from its efficient finetuning and the robust capabilities of the Qwen2.5 base architecture. Its optimized training process suggests potential for applications where rapid iteration or deployment of finetuned models is beneficial.