stefra/qwen_last_full

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 12, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The stefra/qwen_last_full model is a 7.6 billion parameter Qwen2-based instruction-tuned language model, developed by stefra. It was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. This model is optimized for general language understanding and generation tasks, leveraging its Qwen2.5-7B-Instruct foundation.

Loading preview...

Model Overview

The stefra/qwen_last_full is a 7.6 billion parameter language model, fine-tuned by stefra. It is based on the unsloth/Qwen2.5-7B-Instruct-unsloth-bnb-4bit model, indicating its foundation in the Qwen2 architecture.

Key Characteristics

  • Architecture: Built upon the Qwen2.5-7B-Instruct model family.
  • Training Efficiency: This model was fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process compared to standard methods.
  • License: Distributed under the Apache-2.0 license, allowing for broad use and modification.

Potential Use Cases

This model is suitable for a variety of natural language processing tasks, particularly those benefiting from an instruction-tuned Qwen2.5 base. Its efficient fine-tuning process suggests it could be a good candidate for applications where rapid iteration or deployment of Qwen2-based models is desired.