Mohamed475/qwen3-1.7b-fft-dpo-4epochs

TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 6, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The Mohamed475/qwen3-1.7b-fft-dpo-4epochs is a 2 billion parameter Qwen3 model developed by Mohamed475, fine-tuned from Mohamed475/qwen3-1.7b-full_sft-2. This model was trained using Unsloth and Huggingface's TRL library, achieving 2x faster training. It is designed for general language tasks, leveraging its efficient training methodology.

Loading preview...

Model Overview

The Mohamed475/qwen3-1.7b-fft-dpo-4epochs is a 2 billion parameter Qwen3 model developed by Mohamed475. It is a fine-tuned version of the Mohamed475/qwen3-1.7b-full_sft-2 base model, utilizing a Direct Preference Optimization (DPO) approach over 4 epochs.

Key Characteristics

  • Efficient Training: This model was trained significantly faster, specifically 2x faster, by leveraging the Unsloth library in conjunction with Huggingface's TRL (Transformer Reinforcement Learning) library.
  • Architecture: Based on the Qwen3 architecture, providing a robust foundation for various language understanding and generation tasks.
  • Parameter Count: With 2 billion parameters, it offers a balance between performance and computational efficiency.
  • License: Distributed under the Apache-2.0 license, allowing for broad use and modification.

Use Cases

This model is suitable for applications requiring a capable language model with a focus on efficient training and deployment. Its fine-tuned nature suggests improved performance on tasks aligned with its DPO training objectives, making it a strong candidate for general-purpose text generation and understanding where a 2B parameter model is appropriate.