g4me/QwenRolina3-Base-LR1e5-b32g2gc8-order-domain

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Feb 28, 2026Architecture:Transformer Gated Warm

g4me/QwenRolina3-Base-LR1e5-b32g2gc8-order-domain is a fine-tuned causal language model based on Qwen3-1.7B-Base, developed by g4me. This model has been trained using the TRL framework, indicating a focus on reinforcement learning from human feedback or similar fine-tuning methods. It is designed for text generation tasks, leveraging its Qwen3 architecture for general language understanding and response generation.

Loading preview...

Model Overview

g4me/QwenRolina3-Base-LR1e5-b32g2gc8-order-domain is a specialized language model derived from the Qwen3-1.7B-Base architecture. It has undergone Supervised Fine-Tuning (SFT) using the TRL (Transformers Reinforcement Learning) framework, suggesting an optimization for specific conversational or instruction-following tasks.

Key Capabilities

  • Text Generation: Capable of generating coherent and contextually relevant text based on user prompts.
  • Fine-tuned Performance: Benefits from SFT, likely enhancing its ability to follow instructions and produce desired output formats compared to its base model.
  • Qwen3 Architecture: Inherits the foundational language understanding and generation strengths of the Qwen3 family.

Training Details

The model was trained with SFT, utilizing specific versions of key frameworks:

  • TRL: 0.29.0
  • Transformers: 5.2.0
  • Pytorch: 2.8.0a0+34c6371d24.nv25.8
  • Datasets: 4.6.0
  • Tokenizers: 0.22.2

Training progress and metrics can be visualized via Weights & Biases.

Good For

  • General Text Generation: Suitable for various applications requiring natural language output.
  • Instruction Following: Potentially excels in tasks where precise adherence to instructions is crucial, due to its SFT training.
  • Exploration of Qwen3-based Fine-tunes: A good starting point for developers interested in Qwen3 models fine-tuned with TRL.