heyalexchoi/qwen3-1.7b-math-sft-v2

TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 10, 2026Architecture:Transformer Cold

The heyalexchoi/qwen3-1.7b-math-sft-v2 model is a 1.7 billion parameter language model, fine-tuned from Qwen/Qwen3-1.7B-Base. It was trained using Supervised Fine-Tuning (SFT) with the TRL framework. This model is specifically optimized for mathematical reasoning and related tasks, building upon the Qwen3 architecture. It offers a 32768 token context length, making it suitable for processing longer mathematical problems and complex logical sequences.

Loading preview...

Model Overview

This model, heyalexchoi/qwen3-1.7b-math-sft-v2, is a specialized variant of the Qwen3-1.7B-Base architecture. It has undergone Supervised Fine-Tuning (SFT) using the TRL (Transformers Reinforcement Learning) framework, indicating a focus on improving specific task performance rather than broad instruction following.

Key Characteristics

  • Base Model: Fine-tuned from Qwen/Qwen3-1.7B-Base.
  • Parameter Count: Approximately 1.7 billion parameters.
  • Training Method: Utilizes Supervised Fine-Tuning (SFT) for targeted performance enhancement.
  • Frameworks: Developed with TRL, Transformers, PyTorch, Datasets, and Tokenizers.

Potential Use Cases

Given its fine-tuning approach and the base model's capabilities, this model is likely optimized for:

  • Mathematical Problem Solving: Excelling in tasks requiring numerical reasoning, equation solving, and logical deduction.
  • Technical Text Generation: Generating or understanding content related to scientific or engineering domains.
  • Specialized Q&A: Answering questions within specific technical or mathematical contexts.

Users can quickly integrate the model using the provided transformers pipeline for text generation tasks.