Name: TianHongZXY/CHIMERA-4B-SFT API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TianHongZXY

CHIMERA-4B-SFT Overview

CHIMERA-4B-SFT is a 4 billion parameter language model developed by TianHongZXY. It is a supervised fine-tuned (SFT) version of the Qwen3-4B-Thinking-2507 base model, utilizing the specialized CHIMERA dataset.

Key Capabilities & Performance

This model demonstrates significant performance improvements over its base model, Qwen3-4B-Thinking-2507, primarily due to the SFT process. Notable gains are observed in:

GPQA-D: Achieves 68.8%, an increase from 65.8%.
AIME 24: Improves to 86.5% from 81.6%.
HMMT Feb 25: Rises to 63.1% from 59.2%.
HLE: Shows a substantial increase to 9.0 from 7.3.

These results indicate enhanced reasoning and problem-solving abilities, particularly in complex academic and logical tasks. The SFT alone accounts for the majority of these performance gains, with further improvements possible through additional Reinforcement Learning (RL) as seen in the CHIMERA-4B-RL variant.

When to Use This Model

CHIMERA-4B-SFT is particularly well-suited for applications requiring strong analytical performance and improved accuracy on benchmarks related to general knowledge, mathematics, and logical reasoning. Its 32768-token context length supports processing longer inputs for these complex tasks.

Overview

CHIMERA-4B-SFT Overview

Key Capabilities & Performance

When to Use This Model

Full Model Card (README)