icedsoylatte/wz-qwen25-3b-roleplay-dpo-v6

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jul 1, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

icedsoylatte/wz-qwen25-3b-roleplay-dpo-v6 is a 3.1 billion parameter Qwen2-based language model developed by icedsoylatte. This model is fine-tuned for roleplay scenarios using DPO, building upon a prior SFT version. It leverages Unsloth and Huggingface's TRL library for efficient training, making it suitable for generating engaging and coherent roleplay-oriented text.

Loading preview...

Model Overview

The icedsoylatte/wz-qwen25-3b-roleplay-dpo-v6 is a 3.1 billion parameter language model based on the Qwen2 architecture, developed by icedsoylatte. This iteration is specifically fine-tuned using Direct Preference Optimization (DPO) for enhanced roleplay capabilities, evolving from the icedsoylatte/wz-qwen25-3b-coser-roleplay-sft-v4 model.

Key Characteristics

  • Architecture: Qwen2-based, a powerful transformer architecture.
  • Parameter Count: 3.1 billion parameters, offering a balance between performance and computational efficiency.
  • Training Efficiency: The model was trained significantly faster using Unsloth and Huggingface's TRL library, indicating optimized training methodologies.
  • Fine-tuning Method: Utilizes DPO (Direct Preference Optimization) to align model outputs with preferred roleplay responses, suggesting improved quality and coherence in conversational and character-driven interactions.

Intended Use Cases

This model is primarily designed for applications requiring high-quality, engaging, and consistent roleplay generation. It is well-suited for:

  • Interactive Storytelling: Creating dynamic and responsive narratives where the model assumes a character's persona.
  • Chatbots and Virtual Assistants: Developing conversational agents capable of maintaining specific roles or personalities.
  • Creative Content Generation: Assisting in writing dialogues, character backstories, or interactive fiction.