TheDrummer/Gemma-3-R1-4B-v1

Hugging Face
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Aug 7, 2025Architecture:Transformer0.0K Warm

TheDrummer/Gemma-3-R1-4B-v1 is a 4.3 billion parameter Gemma 3 R1 model developed by TheDrummer, featuring a 32768-token context length. This model is specifically fine-tuned for enhanced reasoning capabilities and reduced positivity in its responses. It is designed to be vision-capable, offering advanced multimodal potential for its size. Its primary strength lies in generating creative and unique prose, demonstrating surprising depth for a 4B model.

Loading preview...

Model Overview

TheDrummer/Gemma-3-R1-4B-v1 is a 4.3 billion parameter model based on the Gemma 3 R1 architecture, developed by TheDrummer. It features an impressive 32768-token context length, allowing for extensive input processing. This version is a specialized reasoning tune, designed to unlock more advanced capabilities and exhibit less overtly positive biases in its outputs.

Key Capabilities

  • Enhanced Reasoning: The model is specifically tuned to improve its reasoning abilities, going beyond standard Gemma prose.
  • Vision Capable: It is designed with vision capabilities, suggesting potential for multimodal applications.
  • Creative and Unique Prose: Users have noted its ability to generate surprisingly deep, witty, and unique text, moving beyond typical LLM outputs.
  • Code Generation Potential: Anecdotal evidence suggests it can generate complex code structures, including full HTML, CSS, and JavaScript for interactive elements.

Usage Notes

To initiate assistant turns, users may need to prefill with <think>. The model's design allows for creative modifications of reasoning tags, such as <evil_think> or <creative_think>, as these are not special tokens.

Good For

  • Applications requiring advanced reasoning from a compact model.
  • Generating creative, witty, or unique textual content.
  • Exploring multimodal tasks due to its vision-capable design.
  • Use cases where a less overtly positive or more nuanced tone is desired.