Name: sarapatel/llama31-8b-grpo-gsm8k-run1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: sarapatel

Model Overview

The sarapatel/llama31-8b-grpo-gsm8k-run1 is an 8 billion parameter language model developed by sarapatel. It is fine-tuned from the unsloth/Meta-Llama-3.1-8B-Instruct base model, leveraging the Llama 3.1 architecture.

Key Characteristics

Architecture: Based on the Meta-Llama-3.1-8B-Instruct model.
Training Efficiency: This model was fine-tuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
License: Distributed under the Apache-2.0 license.

Intended Use Cases

This model is suitable for a variety of general-purpose language tasks, benefiting from the Llama 3.1 instruction-tuned base. Its efficient training process suggests a focus on practical application and deployment.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)