Chia-Mu-Lab/d1-llama31-8b-r2answer-ot14b-clean

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The Chia-Mu-Lab/d1-llama31-8b-r2answer-ot14b-clean is an 8 billion parameter Llama 3.1-based instruction-tuned language model developed by Chia-Mu-Lab. It was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. This model is designed for general language understanding and generation tasks, leveraging its Llama 3.1 foundation.

Loading preview...

Model Overview

Chia-Mu-Lab/d1-llama31-8b-r2answer-ot14b-clean is an 8 billion parameter language model developed by Chia-Mu-Lab. It is based on the Meta-Llama-3.1-8B-Instruct architecture and has been instruction-tuned to enhance its performance across various language tasks. The model was fine-tuned using a combination of Unsloth for accelerated training and Huggingface's TRL library.

Key Characteristics

  • Base Model: Fine-tuned from unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit.
  • Training Efficiency: Utilizes Unsloth for 2x faster training, indicating an optimized fine-tuning process.
  • Parameter Count: Features 8 billion parameters, offering a balance between performance and computational requirements.
  • Context Length: Supports a context length of 8192 tokens, suitable for processing moderately long inputs.

Potential Use Cases

This model is suitable for a range of applications that benefit from a Llama 3.1-based instruction-tuned model, including:

  • General question answering.
  • Text generation and completion.
  • Conversational AI and chatbots.
  • Summarization and information extraction.