FinaPolat/RAISED_QWEN_8B_GRPO_2

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 22, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

FinaPolat/RAISED_QWEN_8B_GRPO_2 is an 8 billion parameter Qwen3 model developed by FinaPolat, fine-tuned from FinaPolat/RAISED_QWEN_8B_SFT. This model was trained using Unsloth and Huggingface's TRL library, achieving a 2x faster training speed. It is designed for general language tasks, leveraging its efficient training methodology.

Loading preview...

Model Overview

FinaPolat/RAISED_QWEN_8B_GRPO_2 is an 8 billion parameter language model developed by FinaPolat, based on the Qwen3 architecture. It is a fine-tuned version of the FinaPolat/RAISED_QWEN_8B_SFT model.

Key Characteristics

  • Architecture: Qwen3
  • Parameters: 8 billion
  • Context Length: 32768 tokens
  • Training Efficiency: This model was trained with Unsloth and Huggingface's TRL library, which enabled a 2x faster training process compared to standard methods.
  • License: Apache-2.0

Use Cases

This model is suitable for a variety of general language generation and understanding tasks, benefiting from its efficient training and the capabilities of the Qwen3 base architecture. Its optimized training process suggests potential for applications where rapid iteration or resource efficiency in fine-tuning is valuable.