Name: modrill/math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_6 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: modrill

Overview

This model, modrill/math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_6, is a 4 billion parameter language model built upon the Qwen3-4B-Base architecture. It is created using a task arithmetic merge method, which combines two models to achieve specific performance characteristics. The merge specifically targets enhancing mathematical reasoning.

Key Capabilities

Mathematical Reasoning: The model is a result of merging a fine-tuned Qwen3-4B-Base model (specifically math_think_11_qwen3_4b_base_sft) with the original Qwen/Qwen3-4B-Base.
Task Arithmetic: It utilizes the task arithmetic formula theta = theta_base + scaling * (theta_sft - theta_base) with a scaling coefficient of 0.6 to integrate the specialized mathematical capabilities.

Good For

Arithmetic Tasks: This model is particularly suited for applications requiring improved performance on arithmetic and mathematical reasoning problems, due to its specialized merging approach.
Leveraging Qwen3-4B-Base: Users already familiar with or utilizing the Qwen3-4B-Base architecture can benefit from this version's enhanced mathematical focus.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)