szkiM/Gemma12B-DPO_RSFT2
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Feb 18, 2026Architecture:Transformer Cold

szkiM/Gemma12B-DPO_RSFT2 is a 12 billion parameter language model based on the Gemma architecture. This model has undergone DPO (Direct Preference Optimization) and RSFT2 fine-tuning, indicating an optimization for alignment with human preferences and specific task performance. Its primary strength lies in its fine-tuned nature, making it suitable for applications requiring nuanced response generation and adherence to desired output styles.

Loading preview...