Name: SnippyCodes/libratio-fleet-llama3-grpo API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: SnippyCodes

Model Overview

The SnippyCodes/libratio-fleet-llama3-grpo is an 8 billion parameter instruction-tuned large language model, developed by SnippyCodes. It is based on the Meta-Llama-3.1-8B-Instruct architecture and was fine-tuned using a combination of Unsloth and Huggingface's TRL library. This specific training methodology allowed for a 2x acceleration in the fine-tuning process.

Key Characteristics

Base Model: Fine-tuned from unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit.
Efficient Training: Utilizes Unsloth for significantly faster fine-tuning.
Parameter Count: 8 billion parameters, offering a balance of capability and computational efficiency.
License: Distributed under the Apache 2.0 license.

Good For

Applications requiring a performant Llama 3.1-based model.
Scenarios where efficient fine-tuning and deployment are critical.
Developers looking for a robust instruction-tuned LLM with a permissive license.

Overview

Model Overview

Key Characteristics

Good For

Full Model Card (README)