TheDrummer/Gemma-3-R1-4B-v1
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Aug 7, 2025Architecture:Transformer0.0K Cold

TheDrummer/Gemma-3-R1-4B-v1 is a 4.3 billion parameter Gemma 3 R1 model developed by TheDrummer, featuring a 32768-token context length. This model is specifically fine-tuned for enhanced reasoning capabilities and reduced positivity in its responses. It is designed to be vision-capable, offering advanced multimodal potential for its size. Its primary strength lies in generating creative and unique prose, demonstrating surprising depth for a 4B model.

Loading preview...