szkiM/Gemma12B-DPO_RSFT1
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Feb 14, 2026Architecture:Transformer Cold
szkiM/Gemma12B-DPO_RSFT1 is a 12 billion parameter language model, likely based on the Gemma architecture, with a substantial context length of 32768 tokens. This model has undergone DPO (Direct Preference Optimization) and RSFT (Reinforced Supervised Fine-Tuning), indicating a focus on aligning its outputs with human preferences and improving instruction following. Its large parameter count and context window suggest capabilities for complex language understanding and generation tasks.
Loading preview...