genne/nhn_dpo_v3_T3Q-ko-solar-dpo-v3.0_DPO

Warm
Public
10.7B
FP8
4096
1
License: apache-2.0
Hugging Face

The genne/nhn_dpo_v3_T3Q-ko-solar-dpo-v3.0_DPO model is a fine-tuned version of the chihoonlee10/T3Q-ko-solar-dpo-v3.0 model. It was trained using a learning rate of 5e-07 and a cosine learning rate scheduler over 1 epoch. This model is optimized for tasks related to its base model, though specific differentiators and intended uses require further information.

No reviews yet. Be the first to review!