Parigh1/gugu-merged

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kTool Calling:SupportedPublished:Mar 30, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Parigh1/gugu-merged is an 8 billion parameter Llama-3-Instruct-based causal language model, fine-tuned by Parigh1 using Unsloth for accelerated training. This model leverages the Llama-3 architecture, optimized for instruction-following tasks. Its development focused on efficient fine-tuning, making it suitable for applications requiring a capable instruction-tuned model with a standard 8192 token context length.

Loading preview...

Model Overview

Parigh1/gugu-merged is an 8 billion parameter instruction-tuned language model developed by Parigh1. It is based on the unsloth/llama-3-8b-Instruct-bnb-4bit architecture, indicating its foundation in the Llama-3 family and its optimization for instruction-following tasks.

Key Characteristics

  • Base Model: Fine-tuned from unsloth/llama-3-8b-Instruct-bnb-4bit.
  • Training Efficiency: The model was trained significantly faster using Unsloth and Hugging Face's TRL library, highlighting an emphasis on efficient fine-tuning processes.
  • License: Distributed under the permissive Apache-2.0 license.

Potential Use Cases

This model is well-suited for applications that benefit from an instruction-tuned Llama-3 variant, particularly where efficient deployment and fine-tuning were key considerations during development. It can be applied to various natural language understanding and generation tasks, leveraging its instruction-following capabilities.