Name: cjiao/goldengoose-corr-v4-random-200 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cjiao

Model Overview

The cjiao/goldengoose-corr-v4-random-200 is a 1.5 billion parameter language model, fine-tuned from the Qwen/Qwen2.5-1.5B-Instruct base model. Developed by cjiao, this model leverages a substantial 32768 token context window, making it capable of processing longer inputs and maintaining coherence over extended interactions.

Key Differentiator: GRPO Training

A significant aspect of this model is its training methodology. It was fine-tuned using GRPO (Gradient-based Reasoning Policy Optimization), a method introduced in the research paper "DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models." This specialized training approach aims to enhance the model's capabilities in:

Mathematical Reasoning: Improving its ability to understand and solve complex mathematical problems.
Logical Inference: Strengthening its capacity for structured and accurate reasoning.

Technical Details

The model's training utilized the TRL framework (version 0.19.1) alongside Transformers (4.57.6), Pytorch (2.5.1), Datasets (4.8.4), and Tokenizers (0.22.2).

Use Cases

Given its GRPO-enhanced training, this model is particularly well-suited for applications requiring robust mathematical and logical reasoning, such as:

Solving mathematical word problems.
Generating logical explanations or proofs.
Tasks where precise, step-by-step reasoning is critical.

Overview

Model Overview

Key Differentiator: GRPO Training

Technical Details

Use Cases

Full Model Card (README)