mrshu/qwen3-1.7b-dpo-newbase-bs6
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Apr 2, 2026Architecture:Transformer Cold

The mrshu/qwen3-1.7b-dpo-newbase-bs6 is a 2 billion parameter language model, fine-tuned from Qwen/Qwen3-1.7B using Direct Preference Optimization (DPO). This model is designed for general text generation tasks, leveraging DPO to align its outputs with human preferences. It offers a 32K context length, making it suitable for applications requiring coherent and contextually relevant responses over longer interactions. Its fine-tuning approach aims to enhance the quality and helpfulness of its generated text.

Loading preview...