Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 6, 2025Architecture:Transformer Warm

Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO is a 2 billion parameter Qwen3-based language model developed by Siheng99. This model is specifically fine-tuned for mathematical tasks, leveraging 1024 samples with RePO training. It is designed to excel in mathematical reasoning and problem-solving within its 32768 token context window.

Loading preview...

Model Overview

This model, Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO, is a 2 billion parameter language model built upon the Qwen3 architecture. It has been developed by Siheng99 with a focus on enhancing mathematical capabilities.

Key Characteristics

  • Architecture: Based on the Qwen3 model family.
  • Parameter Count: 2 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports a substantial context window of 32768 tokens, beneficial for complex mathematical problems requiring extensive context.
  • Specialized Training: Fine-tuned using 1024 samples with the RePO (Reinforced Policy Optimization) method, specifically targeting mathematical reasoning.

Intended Use Cases

This model is primarily intended for applications requiring strong mathematical problem-solving and reasoning. While specific benchmarks are not provided in the README, its specialized training suggests suitability for:

  • Solving mathematical equations and problems.
  • Assisting in mathematical research and analysis.
  • Educational tools for mathematics.

Limitations

The provided model card indicates that much information regarding its development, training data, evaluation, biases, risks, and specific use cases is currently "More Information Needed." Users should be aware that without further details, the full scope of its capabilities and potential limitations remains to be thoroughly assessed.