FinaPolat/llama3_1_8b_dpo-1k_ED
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 21, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

FinaPolat/llama3_1_8b_dpo-1k_ED is an 8 billion parameter Llama 3.1 model developed by FinaPolat, fine-tuned using Huggingface's TRL library. This model was trained twice as fast leveraging the Unsloth framework. It is a fine-tuned iteration of FinaPolat/llama3_1_8b_sft-1k_ED, designed for efficient deployment and performance.

Loading preview...