malteos/german-r1

Warm
Public
3.1B
BF16
32768
Hugging Face
Overview

German-R1: A German Reasoning Model

malteos/german-r1 is a specialized language model developed by malteos, focusing on reasoning capabilities in German. It is built upon the Qwen2.5-3B-Instruct base model, indicating a foundation in a robust instruction-tuned architecture.

Key Capabilities

  • German Reasoning: The primary strength of German-R1 is its ability to perform and articulate reasoning processes in German. It aims to provide performance comparable to models like OpenAI's o3 or DeepSeek's R1, but with a specific focus on the German language.
  • Structured Output: The model is designed to output responses in a structured XML-like format, including <reasoning> and <answer> tags, which is beneficial for parsing and evaluating its problem-solving steps.
  • Mathematical Problem Solving: It has been fine-tuned using a German subset of the openGPT-X/gsm8kx dataset, which consists of machine-translated mathematical word problems. This training makes it particularly adept at arithmetic and logical reasoning tasks.

Training Details

The model's training involved using the Qwen2.5-3B-Instruct as its base. The fine-tuning process was based on a GRPO (Generalized Reinforcement Learning from Human Feedback) demo, incorporating language identification reward to enhance its German reasoning abilities.

Good For

  • German-speaking applications requiring mathematical or logical reasoning.
  • Educational tools or platforms that need to generate step-by-step solutions to problems in German.
  • Developers looking for a specialized model for structured reasoning output in German.