didula-wso2/gemma4_sft-ballerina_klge_easysft_16bit_vllm

VISIONConcurrency Cost:1Model Size:7.9BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 22, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The didula-wso2/gemma4_sft-ballerina_klge_easysft_16bit_vllm is a 7.9 billion parameter Gemma 4 model, fine-tuned by didula-wso2. This model was trained using Unsloth and Huggingface's TRL library, enabling faster fine-tuning. It is designed for general language understanding and generation tasks, leveraging its large context window of 32768 tokens for complex prompts.

Loading preview...

Model Overview

The didula-wso2/gemma4_sft-ballerina_klge_easysft_16bit_vllm is a fine-tuned Gemma 4 model, developed by didula-wso2. This model, with approximately 7.9 billion parameters and a substantial context length of 32768 tokens, is built upon the unsloth/gemma-4-E4B-it base.

Key Characteristics

  • Architecture: Based on the Gemma 4 family, known for its strong performance in various language tasks.
  • Efficient Fine-tuning: The model was fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
  • Large Context Window: Features a 32768-token context length, allowing it to process and generate longer, more complex sequences of text.

Potential Use Cases

This model is suitable for a range of applications requiring robust language understanding and generation, particularly where the benefits of a large context window are valuable. Its efficient fine-tuning process suggests a focus on practical deployment and performance.