Name: thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: thwannbe

Model Overview

The thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr is an 8 billion parameter instruction-tuned language model. While specific details regarding its development and base architecture are not provided in the model card, its naming convention suggests it is derived from the Llama 3.1 series and has undergone further instruction-tuning.

Key Capabilities

Mathematical Reasoning: The model is specifically fine-tuned for performance on the GSM8K dataset, indicating a strong capability in solving grade-school level mathematical word problems.
Instruction Following: As an "Instruct" model, it is designed to follow user instructions effectively, making it suitable for conversational agents or task-oriented applications where clear directives are given.

Use Cases

Educational Tools: Ideal for applications assisting with mathematical homework, tutoring, or generating practice problems.
Logical Problem Solving: Can be applied to tasks requiring step-by-step numerical and logical inference.
Research and Development: Useful for researchers exploring the capabilities of smaller, specialized instruction-tuned models on specific reasoning tasks.

Overview

Model Overview

Key Capabilities

Use Cases

Full Model Card (README)