FinaPolat/qwen3_8b_dpo-1k_ED
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 13, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
FinaPolat/qwen3_8b_dpo-1k_ED is an 8 billion parameter Qwen3 model developed by FinaPolat, fine-tuned from FinaPolat/qwen3_8b_sft-1k_ED. This model was trained using Unsloth and Huggingface's TRL library, achieving 2x faster training. It is designed for general language tasks, leveraging its efficient training methodology.
Loading preview...