Dnfs/gema-4b-indra10k-model1

Cold
Public
Vision
4.3B
BF16
32768
License: apache-2.0
Hugging Face
Overview

Overview

Dnfs/gema-4b-indra10k-model1 is a 4.3 billion parameter language model based on the Gemma 3 architecture. Developed by Dnfs, this model was fine-tuned using a combination of Unsloth and Hugging Face's TRL library, which facilitated a 2x faster training process. The model operates under an Apache-2.0 license.

Key Characteristics

  • Architecture: Based on the Gemma 3 model family.
  • Parameter Count: Features 4.3 billion parameters.
  • Training Efficiency: Utilizes Unsloth and Hugging Face TRL for accelerated fine-tuning.
  • Context Length: Supports a context window of 32768 tokens.

Good For

  • Applications requiring a moderately sized language model with efficient training origins.
  • General text generation and understanding tasks where the Gemma 3 architecture is suitable.
  • Developers interested in models fine-tuned with Unsloth for performance benefits.