Name: SantiagoC/palindrome-curriculum-v2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: SantiagoC

Model Overview

SantiagoC/palindrome-curriculum-v2 is a 0.8 billion parameter language model developed by SantiagoC. It is a fine-tuned iteration of the SantiagoC/palindrome-sft-qwen3 base model, specifically trained using the GRPO (Gradient-based Reward Policy Optimization) method. This training approach, detailed in the paper "DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models," focuses on improving mathematical reasoning.

Key Characteristics

Base Model: Fine-tuned from SantiagoC/palindrome-sft-qwen3.
Training Method: Utilizes GRPO, a technique aimed at enhancing reasoning abilities, particularly in mathematical domains.
Context Length: Supports a context length of 32768 tokens.

Intended Use

This model is suitable for applications requiring strong reasoning capabilities, especially those involving mathematical problem-solving or logical deduction, due to its specialized GRPO training.

Overview

Model Overview

Key Characteristics

Intended Use

Full Model Card (README)