malteos/german-r1

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Feb 12, 2025Architecture:Transformer0.0K Warm

malteos/german-r1 is a German reasoning model based on Qwen2.5-3B-Instruct, fine-tuned specifically for mathematical reasoning tasks in German. It utilizes a German subset of the gsm8kx dataset and is designed to provide structured reasoning and answers in German, similar to OpenAI's o3 or DeepSeek's R1 models. This model is optimized for generating detailed, step-by-step explanations and numerical answers for arithmetic problems.

Loading preview...

German-R1: A German Reasoning Model

malteos/german-r1 is a specialized language model developed by malteos, focusing on reasoning capabilities in German. It is built upon the Qwen2.5-3B-Instruct base model, indicating a foundation in a robust instruction-tuned architecture.

Key Capabilities

  • German Reasoning: The primary strength of German-R1 is its ability to perform and articulate reasoning processes in German. It aims to provide performance comparable to models like OpenAI's o3 or DeepSeek's R1, but with a specific focus on the German language.
  • Structured Output: The model is designed to output responses in a structured XML-like format, including <reasoning> and <answer> tags, which is beneficial for parsing and evaluating its problem-solving steps.
  • Mathematical Problem Solving: It has been fine-tuned using a German subset of the openGPT-X/gsm8kx dataset, which consists of machine-translated mathematical word problems. This training makes it particularly adept at arithmetic and logical reasoning tasks.

Training Details

The model's training involved using the Qwen2.5-3B-Instruct as its base. The fine-tuning process was based on a GRPO (Generalized Reinforcement Learning from Human Feedback) demo, incorporating language identification reward to enhance its German reasoning abilities.

Good For

  • German-speaking applications requiring mathematical or logical reasoning.
  • Educational tools or platforms that need to generate step-by-step solutions to problems in German.
  • Developers looking for a specialized model for structured reasoning output in German.