Chia-Mu-Lab/d1-llama31-8b-r2answer-ot14b-clean-step1668

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The Chia-Mu-Lab/d1-llama31-8b-r2answer-ot14b-clean-step1668 is an 8 billion parameter Llama 3.1 instruction-tuned model developed by Chia-Mu-Lab. This model was fine-tuned using Unsloth and Hugging Face's TRL library, enabling 2x faster training. It is designed for general language tasks, leveraging its efficient training methodology to provide a capable and optimized Llama 3.1 variant.

Loading preview...

Model Overview

The Chia-Mu-Lab/d1-llama31-8b-r2answer-ot14b-clean-step1668 is an 8 billion parameter language model, fine-tuned by Chia-Mu-Lab. It is based on the Llama 3.1 architecture and leverages the unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit model as its base.

Key Characteristics

  • Efficient Training: This model was fine-tuned with Unsloth and Hugging Face's TRL library, which facilitated training at 2x the speed compared to conventional methods.
  • Llama 3.1 Base: Built upon the robust Llama 3.1 instruction-tuned architecture, providing strong foundational capabilities for various language understanding and generation tasks.
  • Parameter Count: Features 8 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports a context length of 8192 tokens, suitable for processing moderately long inputs.

Use Cases

This model is suitable for applications requiring a capable Llama 3.1 variant that benefits from optimized training. Its instruction-tuned nature makes it versatile for tasks such as:

  • Question answering
  • Text generation
  • Summarization
  • Instruction following