FinaPolat/RAISED_QWEN_8B_DPO_2

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 21, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

FinaPolat/RAISED_QWEN_8B_DPO_2 is an 8 billion parameter Qwen3-based causal language model developed by FinaPolat, fine-tuned using Direct Preference Optimization (DPO). This model was trained 2x faster with Unsloth and Huggingface's TRL library, building upon the FinaPolat/RAISED_QWEN_8B_SFT base. It is designed for general language generation tasks, leveraging its DPO fine-tuning for improved response quality and alignment.

Loading preview...

Model Overview

FinaPolat/RAISED_QWEN_8B_DPO_2 is an 8 billion parameter language model based on the Qwen3 architecture, developed by FinaPolat. This model is a fine-tuned iteration, specifically utilizing Direct Preference Optimization (DPO) to enhance its performance and alignment, building upon the previously fine-tuned FinaPolat/RAISED_QWEN_8B_SFT model.

Key Characteristics

  • Architecture: Qwen3-based, a powerful transformer architecture known for its strong performance across various language tasks.
  • Parameter Count: 8 billion parameters, offering a balance between computational efficiency and robust language understanding/generation capabilities.
  • Fine-tuning Method: Leverages Direct Preference Optimization (DPO), a technique designed to align model outputs with human preferences more effectively than traditional reinforcement learning methods.
  • Training Efficiency: The model was trained significantly faster (2x) using the Unsloth library in conjunction with Huggingface's TRL library, indicating an optimized and efficient training pipeline.

Intended Use Cases

This model is suitable for a variety of general-purpose natural language processing tasks where high-quality, preference-aligned text generation is desired. Its DPO fine-tuning suggests improved conversational abilities and adherence to desired output styles. Developers can integrate this model for applications requiring:

  • Text generation and completion
  • Chatbot development
  • Content creation
  • Summarization and rephrasing