Name: jaredfern/canoe-1_1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: jaredfern

Model Overview

jaredfern/canoe-1_1 is an 8 billion parameter instruction-tuned language model, built upon the robust foundation of meta-llama/Llama-3.1-8B-Instruct. This model distinguishes itself through its specialized training methodology.

Key Capabilities

Enhanced Mathematical Reasoning: The model was fine-tuned using the GRPO (Gradient-based Reward Policy Optimization) method, a technique highlighted in the DeepSeekMath paper. This training approach is specifically designed to improve the model's ability to handle complex mathematical problems and reasoning tasks.
Instruction Following: As an instruction-tuned model, it is adept at understanding and executing user prompts, making it suitable for a variety of conversational and task-oriented applications.
Extended Context Window: It supports a substantial context length of 32768 tokens, allowing it to process and generate longer, more coherent responses based on extensive input.

Training Details

The fine-tuning process utilized the TRL (Transformer Reinforcement Learning) framework. The GRPO method, central to its training, aims to push the limits of mathematical reasoning in open language models, suggesting a strong focus on accuracy and logical consistency in numerical and analytical tasks.

Good For

Applications requiring strong mathematical problem-solving.
Tasks that benefit from advanced reasoning capabilities.
Use cases where understanding and responding to complex instructions is critical.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)