Name: cjiao/golden-goose-qwen2.5-1.5b-instruct-stratified-groups API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cjiao

Model Overview

The cjiao/golden-goose-qwen2.5-1.5b-instruct-stratified-groups is a 1.5 billion parameter instruction-tuned language model, built upon the robust Qwen2.5-1.5B-Instruct architecture. This model distinguishes itself through its specialized training methodology.

Key Differentiator: GRPO Training

The primary innovation of this model lies in its training procedure. It was fine-tuned using GRPO (Grouped Reinforcement Learning with Policy Optimization), a method introduced in the paper "DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models". This technique is specifically designed to enhance the model's capabilities in mathematical reasoning tasks.

Technical Details

Base Model: Qwen/Qwen2.5-1.5B-Instruct
Parameter Count: 1.5 Billion
Context Length: 32768 tokens
Training Framework: TRL (Transformers Reinforcement Learning)

Potential Use Cases

Given its GRPO-enhanced training, this model is particularly well-suited for applications that involve:

Mathematical problem-solving: Tasks requiring logical deduction and numerical computation.
Reasoning-intensive queries: Scenarios where understanding and applying mathematical principles are crucial.
Instruction following: Benefiting from its instruction-tuned base, combined with improved reasoning for complex instructions.

Overview

Model Overview

Key Differentiator: GRPO Training

Technical Details

Potential Use Cases

Full Model Card (README)