TheHassanSaud/ramzan_sft_gemma3_with_updated_templat
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Jan 11, 2026License:otherArchitecture:Transformer Cold

TheHassanSaud/ramzan_sft_gemma3_with_updated_templat is a 12 billion parameter language model, fine-tuned by TheHassanSaud from ramzanniaz331/gemma3-12b-2048-v3. This model leverages a 32768 token context length and has been specialized through supervised fine-tuning on a diverse set of datasets including ramzan_5k_batch_1, ramzan_5k_batch_2, ramzan_openhermes, ramzan_metamath, and ramzan_aya_urdu. It is designed for general language generation tasks, with a focus on leveraging the combined knowledge from its training data.

Loading preview...