mcwei/gemma-4-31B-it-bf16-sft-300

VISIONConcurrency Cost:2Model Size:31BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 15, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The mcwei/gemma-4-31B-it-bf16-sft-300 is a 31 billion parameter instruction-tuned Gemma 4 model developed by mcwei. This model was fine-tuned using Unsloth and Huggingface's TRL library, achieving 2x faster training. It is designed for general instruction-following tasks, leveraging its large parameter count and efficient training methodology.

Loading preview...

Model Overview

The mcwei/gemma-4-31B-it-bf16-sft-300 is a 31 billion parameter instruction-tuned model based on the Gemma 4 architecture. Developed by mcwei, this model was fine-tuned from unsloth/gemma-4-31B-it using the Unsloth library and Huggingface's TRL library.

Key Characteristics

  • Architecture: Gemma 4, a powerful open-source model family.
  • Parameter Count: 31 billion parameters, indicating a strong capacity for complex tasks.
  • Training Efficiency: Fine-tuned with Unsloth, which enabled 2x faster training compared to standard methods.
  • Context Length: Supports a substantial context window of 32768 tokens, allowing for processing longer inputs and generating more coherent, extended responses.

Use Cases

This model is well-suited for a variety of instruction-following applications, benefiting from its large size and efficient fine-tuning. Its capabilities make it a strong candidate for:

  • General conversational AI.
  • Complex question answering.
  • Content generation requiring detailed and lengthy outputs.
  • Tasks where processing extensive context is crucial.