Name: cjiao/goldengoose-corr-v2-random-100 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cjiao

Model Overview

The cjiao/goldengoose-corr-v2-random-100 is a 1.5 billion parameter language model, fine-tuned from the Qwen/Qwen2.5-1.5B-Instruct base model. It leverages the TRL (Transformer Reinforcement Learning) framework for its training process.

Key Capabilities

Enhanced Reasoning: This model was trained using the GRPO (Guided Reinforcement Learning for Policy Optimization) method, as introduced in the DeepSeekMath paper, suggesting an optimization for improved reasoning, particularly in mathematical contexts.
Instruction Following: As an instruction-tuned model, it is designed to understand and respond effectively to user prompts and instructions.
Context Handling: Supports a substantial context length of 32768 tokens, allowing for processing and generating longer, more complex texts while maintaining coherence.

Good For

Mathematical Reasoning Tasks: The application of the GRPO method indicates potential strengths in tasks requiring logical and mathematical problem-solving.
General Text Generation: Suitable for a wide range of text generation tasks where instruction following and coherent output are important.
Research and Experimentation: Provides a fine-tuned model based on a robust foundation, ideal for further research into instruction tuning and reasoning enhancements.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)