maheshrawat18/Qwen3-8B-sft-orpo-v2

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 29, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

maheshrawat18/Qwen3-8B-sft-orpo-v2 is an 8 billion parameter Qwen3-based language model developed by maheshrawat18, fine-tuned from maheshrawat18/Qwen3-8B-sft. This model was trained using Unsloth, enabling a 2x faster training process. It is designed for general language tasks, leveraging its Qwen3 architecture and efficient training methodology.

Loading preview...

Model Overview

maheshrawat18/Qwen3-8B-sft-orpo-v2 is an 8 billion parameter language model built upon the Qwen3 architecture. Developed by maheshrawat18, this model is a fine-tuned version of maheshrawat18/Qwen3-8B-sft.

Key Characteristics

  • Base Model: Qwen3 architecture.
  • Parameter Count: 8 billion parameters.
  • Training Efficiency: Utilizes Unsloth for training, which reportedly enabled a 2x faster training process.
  • License: Distributed under the Apache-2.0 license.

Intended Use

This model is suitable for a variety of general language understanding and generation tasks, benefiting from its Qwen3 foundation and optimized training. Its efficient development process suggests a focus on practical application and performance.