Name: yunjae-won/mpq3_qwen4bi_sft_dpo_beta1e-1_step3584 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: yunjae-won

Model Overview

The yunjae-won/mpq3_qwen4bi_sft_dpo_beta1e-1_step3584 is a 4 billion parameter language model, featuring a substantial context length of 32768 tokens. While the base architecture is not explicitly stated, the naming convention suggests a relation to the Qwen family of models. This particular iteration has undergone a training regimen involving Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO), indicating an effort to align its outputs with human preferences and instructions.

Key Characteristics

Parameter Count: 4 billion parameters, offering a balance between performance and computational efficiency.
Context Length: A large 32768 token context window, enabling the processing of extensive inputs and generating coherent, long-form responses.
Training Methodology: Utilizes Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) with a beta value of 1e-1, suggesting a focus on instruction following and preference alignment.

Current Limitations

As per the provided model card, detailed information regarding the model's specific capabilities, intended uses, training data, evaluation results, and potential biases or limitations is currently marked as "More Information Needed." Users should exercise caution and conduct their own evaluations before deploying this model in production environments, as its specific strengths and weaknesses are not yet documented.

Overview

Model Overview

Key Characteristics

Current Limitations

Full Model Card (README)