Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step834

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step834 is a 7.6 billion parameter Qwen2.5-based causal language model, fine-tuned by Chia-Mu-Lab. This model was efficiently trained using Unsloth and Huggingface's TRL library, enabling faster development. It is designed for general language understanding and generation tasks, leveraging its Qwen2.5 architecture and 32768 token context length.

Loading preview...

Model Overview

Chia-Mu-Lab/d1-qwen25-7b-r2answer-ot14b-clean-step834 is a 7.6 billion parameter language model, fine-tuned by Chia-Mu-Lab from the unsloth/Qwen2.5-7B-Instruct-bnb-4bit base model. This model benefits from an efficient training process, having been developed using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training speed.

Key Characteristics

  • Base Model: Qwen2.5-7B-Instruct
  • Parameter Count: 7.6 billion parameters
  • Context Length: 32768 tokens
  • Training Efficiency: Utilizes Unsloth for accelerated fine-tuning.

Intended Use Cases

This model is suitable for a variety of natural language processing tasks, particularly those that benefit from the Qwen2.5 architecture's capabilities. Its efficient fine-tuning process suggests it could be a good candidate for applications requiring a balance of performance and resource optimization.