ChuGyouk/Arguinas-Qwen3-8B-25p-lr2e6

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 28, 2026Architecture:Transformer Warm

Arguinas-Qwen3-8B-25p-lr2e6 is an 8 billion parameter language model, fine-tuned from unsloth/Qwen3-8B using the TRL framework. This model is optimized for general text generation tasks, leveraging its Qwen3 base for robust language understanding. It features a 32K context length, making it suitable for processing longer inputs and generating coherent, extended responses.

Loading preview...

Model Overview

Arguinas-Qwen3-8B-25p-lr2e6 is an 8 billion parameter language model, fine-tuned from the unsloth/Qwen3-8B base model. This fine-tuning process utilized the Transformer Reinforcement Learning (TRL) library, specifically employing Supervised Fine-Tuning (SFT) to enhance its capabilities. The model was developed by ChuGyouk and benefits from the robust architecture of the Qwen3 series.

Key Capabilities

  • General Text Generation: Excels at producing coherent and contextually relevant text based on user prompts.
  • Qwen3 Base: Inherits the strong language understanding and generation abilities of the Qwen3 architecture.
  • Fine-tuned Performance: Optimized through SFT with TRL, suggesting improved performance on specific tasks or conversational fluency compared to its base model.
  • Extended Context Window: Supports a context length of 32,768 tokens, allowing for processing and generating longer sequences of text.

Training Details

The model was trained using the TRL framework (version 0.24.0) with Transformers (4.57.6), Pytorch (2.10.0+cu130), and Datasets (4.3.0). The training procedure involved Supervised Fine-Tuning (SFT), a common method for adapting pre-trained language models to specific tasks or styles. Further details on the training run can be visualized via Weights & Biases.

Good For

  • Developers seeking a fine-tuned 8B parameter model for various text generation applications.
  • Use cases requiring a model with a substantial context window for handling longer inputs or generating detailed outputs.
  • Experimentation with models fine-tuned using the TRL framework.