shemilk/gemma-3-12b-merged-m-e-h
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Feb 16, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The shemilk/gemma-3-12b-merged-m-e-h is a 12 billion parameter instruction-tuned causal language model developed by shemilk. This model is finetuned from unsloth/gemma-3-12b-it-unsloth-bnb-4bit and was trained using Unsloth and Huggingface's TRL library, enabling 2x faster training. It is designed for general language generation tasks, leveraging its Gemma 3 architecture and 32768 token context length.

Loading preview...