hyunseoki/verl-math-transfer-7bi-to-3bi-fix03

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 27, 2026Architecture:Transformer Warm

The hyunseoki/verl-math-transfer-7bi-to-3bi-fix03 model is a 7.6 billion parameter Qwen2ForCausalLM architecture developed by hyunseoki, specifically designed for mathematical transfer learning experiments. This model represents a transfer from a 7B to a 3B configuration, focusing on improving mathematical reasoning capabilities. It is primarily intended for research and development in the domain of mathematical problem-solving and model scaling.

Loading preview...

Overview

The hyunseoki/verl-math-transfer-7bi-to-3bi-fix03 model is a 7.6 billion parameter language model built on the Qwen2ForCausalLM architecture. It is the result of a math transfer experiment conducted using the verl framework, specifically designed to explore the transfer of mathematical capabilities from a larger 7 billion parameter model to a smaller 3 billion parameter configuration.

This repository contains various exported Hugging Face checkpoints for the 7B-to-3B fix_0_3 configuration, with the main branch currently pointing to the step-130 checkpoint. Users can load specific checkpoint revisions, such as step-010 through step-130, to analyze the progression of the transfer learning process.

Key Characteristics

  • Architecture: Qwen2ForCausalLM.
  • Parameter Count: 7.6 billion parameters.
  • Purpose: Focused on mathematical transfer learning experiments.
  • Checkpoints: Provides multiple step revisions, allowing for granular analysis of training progress.
  • Export Format: Checkpoints are exported from verl FSDP shards into Hugging Face safetensors format.

Intended Use Cases

This model is particularly suited for:

  • Research in mathematical reasoning: Investigating how mathematical knowledge transfers between models of different sizes.
  • Experimentation with model scaling: Studying the effects of down-scaling models while retaining specific capabilities.
  • Development of math-focused LLMs: As a base or reference for further fine-tuning on mathematical tasks.