Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step278

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step278 is a 7.6 billion parameter Qwen2.5-Instruct model, fine-tuned by Chia-Mu-Lab. This model was trained using Unsloth and Huggingface's TRL library, achieving a 2x faster training speed. It is designed for general instruction-following tasks, leveraging its Qwen2.5 base for robust performance.

Loading preview...

Model Overview

The Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step278 is a 7.6 billion parameter language model developed by Chia-Mu-Lab. It is fine-tuned from the unsloth/Qwen2.5-7B-Instruct-bnb-4bit base model, leveraging the Qwen2.5 architecture for its capabilities.

Key Training Details

  • Base Model: Fine-tuned from unsloth/Qwen2.5-7B-Instruct-bnb-4bit.
  • Training Efficiency: This model was trained with a significant focus on efficiency, achieving a 2x faster training speed compared to standard methods. This was accomplished by utilizing the Unsloth library in conjunction with Huggingface's TRL library.

Intended Use

This model is suitable for general instruction-following tasks, benefiting from the robust performance characteristics of the Qwen2.5 family. Its optimized training process suggests a focus on delivering capable performance efficiently.