cs-552-2026-OAAA/math_model

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:May 18, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The cs-552-2026-OAAA/math_model is a 1.7 billion parameter causal language model from the Qwen3 series, developed by Qwen. It features a unique capability to seamlessly switch between a 'thinking mode' for complex logical reasoning, mathematics, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. With a context length of 32,768 tokens, this model is optimized for enhanced reasoning, instruction-following, and agent capabilities across various scenarios.

Loading preview...

Qwen3-1.7B: Adaptive Reasoning and Multilingual LLM

Qwen3-1.7B is a 1.7 billion parameter causal language model from the Qwen series, designed for advanced reasoning and flexible application. Its core innovation lies in its ability to dynamically switch between two operational modes:

Key Capabilities & Differentiators

  • Adaptive Thinking Modes: Uniquely supports seamless switching between a 'thinking mode' for complex logical reasoning, mathematics, and coding, and a 'non-thinking mode' for efficient, general-purpose dialogue. This ensures optimal performance tailored to the task.
  • Enhanced Reasoning: Demonstrates significant improvements in mathematical problem-solving, code generation, and commonsense logical reasoning, surpassing previous Qwen models.
  • Superior Human Alignment: Excels in creative writing, role-playing, multi-turn dialogues, and instruction following, providing a more natural and engaging conversational experience.
  • Advanced Agent Capabilities: Offers robust tool-calling abilities, integrating precisely with external tools in both thinking and non-thinking modes, achieving leading performance in complex agent-based tasks among open-source models.
  • Extensive Multilingual Support: Supports over 100 languages and dialects, with strong capabilities for multilingual instruction following and translation.
  • Long Context Window: Features a substantial context length of 32,768 tokens, allowing for processing and generating longer, more complex texts.

Best Practices for Optimal Performance

  • Sampling Parameters: Specific Temperature, TopP, TopK, and MinP settings are recommended for each mode (e.g., Temperature=0.6 for thinking mode, 0.7 for non-thinking mode) to prevent performance degradation and endless repetitions.
  • Adequate Output Length: Recommends an output length of 32,768 tokens for most queries, and up to 38,912 tokens for highly complex problems like math and programming competitions.
  • Standardized Output: Suggests using specific prompts to standardize outputs for benchmarking, especially for math problems and multiple-choice questions.