Name: cjiao/goldengoose-high_div_rand_top-25grp API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cjiao

Model Overview

The cjiao/goldengoose-high_div_rand_top-25grp is a 1.5 billion parameter language model, fine-tuned from the Qwen/Qwen2.5-1.5B-Instruct base model. It was developed by cjiao and trained using the TRL library.

Key Training Innovation

A significant aspect of this model is its training procedure, which incorporates GRPO (Gradient Regularized Policy Optimization). This method was introduced in the research paper "DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models". The application of GRPO suggests a focus on enhancing the model's capabilities in complex reasoning tasks, particularly in the domain of mathematics.

Technical Details

Base Model: Qwen/Qwen2.5-1.5B-Instruct
Parameter Count: 1.5 Billion
Context Length: 32768 tokens
Training Frameworks: TRL (0.19.1), Transformers (4.57.6), PyTorch (2.5.1), Datasets (4.8.4), Tokenizers (0.22.2)

Potential Use Cases

Given its fine-tuning with GRPO, this model is likely well-suited for:

Mathematical problem-solving: Tasks requiring logical deduction and numerical accuracy.
Reasoning-intensive applications: Scenarios where robust analytical capabilities are needed.
Instruction-following: Leveraging its base as an instruction-tuned model for specific tasks.

Overview

Model Overview

Key Training Innovation

Technical Details

Potential Use Cases

Full Model Card (README)