ericrisco/gemma-3-4b-reasoning
VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Mar 13, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
ericrisco/gemma-3-4b-reasoning is a 4.3 billion parameter transformer-based language model fine-tuned by Eric Risco using GRPO (Group Reward Policy Optimization) and DeepSeek-R1 methodology. Optimized for reasoning tasks, it excels in structured, logical problem-solving and mathematical reasoning, particularly on datasets like GSM8K. This model is designed for multi-step problem solving and instruction-based reasoning, offering robust Chain-of-Thought capabilities.
Loading preview...