Name: yunjae-won/mpq3_qwen4bi_sft_dpo_beta1e-1_step4608 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: yunjae-won

Model Overview

The yunjae-won/mpq3_qwen4bi_sft_dpo_beta1e-1_step4608 is a 4 billion parameter language model, likely derived from the Qwen architecture. It has undergone fine-tuning using both Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) techniques, indicating an effort to align its outputs with human preferences and instructions.

Key Characteristics

Parameter Count: 4 billion parameters, placing it in the medium-sized LLM category.
Context Length: Features a significant context window of 32768 tokens, allowing it to process and generate longer sequences of text.
Training Methodology: Utilizes SFT and DPO, suggesting an instruction-following capability and improved response quality.

Limitations

The provided model card indicates that specific details regarding its development, funding, model type, language(s), license, and finetuning base are currently "More Information Needed." Consequently, its precise capabilities, intended direct and downstream uses, and potential biases or risks are not explicitly documented. Users should exercise caution and conduct further evaluation before deploying this model in sensitive applications.

Overview

Model Overview

Key Characteristics

Limitations

Full Model Card (README)