Name: Mohamed475/qwen3-1.7b-fft-dpo-4epochs API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Mohamed475

Model Overview

The Mohamed475/qwen3-1.7b-fft-dpo-4epochs is a 2 billion parameter Qwen3 model developed by Mohamed475. It is a fine-tuned version of the Mohamed475/qwen3-1.7b-full_sft-2 base model, utilizing a Direct Preference Optimization (DPO) approach over 4 epochs.

Key Characteristics

Efficient Training: This model was trained significantly faster, specifically 2x faster, by leveraging the Unsloth library in conjunction with Huggingface's TRL (Transformer Reinforcement Learning) library.
Architecture: Based on the Qwen3 architecture, providing a robust foundation for various language understanding and generation tasks.
Parameter Count: With 2 billion parameters, it offers a balance between performance and computational efficiency.
License: Distributed under the Apache-2.0 license, allowing for broad use and modification.

Use Cases

This model is suitable for applications requiring a capable language model with a focus on efficient training and deployment. Its fine-tuned nature suggests improved performance on tasks aligned with its DPO training objectives, making it a strong candidate for general-purpose text generation and understanding where a 2B parameter model is appropriate.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)