mhmsadegh/Llama-3.1-8B-Instruct-bnb-16bit-2-sfand-cause-effect-model

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 21, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The mhmsadegh/Llama-3.1-8B-Instruct-bnb-16bit-2-sfand-cause-effect-model is an 8 billion parameter instruction-tuned Llama 3.1 model, developed by mhmsadegh. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling 2x faster training. It is designed for general instruction-following tasks, leveraging its Llama 3.1 architecture and 32768 token context length for robust performance.

Loading preview...

Model Overview

This model, developed by mhmsadegh, is an 8 billion parameter instruction-tuned variant of the Llama 3.1 architecture. It was fine-tuned from unsloth/llama-3.1-8b-instruct-bnb-4bit and utilizes Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.

Key Characteristics

  • Base Model: Llama 3.1-8B-Instruct
  • Parameter Count: 8 billion
  • Context Length: 32768 tokens
  • Training Optimization: Fine-tuned with Unsloth for accelerated training.
  • License: Apache-2.0

Use Cases

This model is suitable for a wide range of general instruction-following applications, benefiting from its Llama 3.1 foundation and optimized training. Its 8B parameter size makes it a capable option for tasks requiring a balance of performance and computational efficiency.