Name: Neelectric/Llama-3.1-8B-Instruct_SDFT_mathv00.01 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Neelectric

Neelectric/Llama-3.1-8B-Instruct_SDFT_mathv00.01 Overview

This model is an 8 billion parameter instruction-tuned language model developed by Neelectric, built upon the powerful meta-llama/Llama-3.1-8B-Instruct architecture. Its primary differentiator is its specialized fine-tuning for mathematical tasks.

Key Capabilities & Training

Mathematical Reasoning: The model has been specifically fine-tuned on the Neelectric/OpenR1-Math-220k_all_Llama3_2048toks_SDFT dataset, indicating a strong focus on mathematical problem-solving and understanding.
SDFT Training Method: It leverages the Self-Training with On-Policy Self-Distillation (SDFT) method, as introduced in the paper "Self-Training with On-Policy Self-Distillation for Language Model Alignment" (arXiv:2601.19897). This method aims to enhance model alignment and performance through self-distillation.
TRL Framework: The fine-tuning process was conducted using the TRL (Transformers Reinforcement Learning) library, a framework for training large language models.

Use Cases

This model is particularly well-suited for applications requiring:

Mathematical Problem Solving: Ideal for tasks involving arithmetic, algebra, geometry, and other mathematical reasoning.
Educational Tools: Can be integrated into platforms for generating math explanations, solving problems, or creating quizzes.
Research in Mathematical AI: Provides a strong baseline for further research and development in AI models focused on quantitative tasks.