Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.08

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 19, 2026Architecture:Transformer Warm

Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.08 is an 8 billion parameter instruction-tuned model developed by Neelectric, fine-tuned from meta-llama/Llama-3.1-8B-Instruct. This model is specifically fine-tuned on a mathematical dataset, making it optimized for mathematical reasoning and problem-solving tasks. It leverages a 32768 token context length, enhancing its ability to handle complex mathematical prompts.

Loading preview...

Model Overview

Neelectric/Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.08 is an 8 billion parameter instruction-tuned language model, fine-tuned by Neelectric from the base meta-llama/Llama-3.1-8B-Instruct architecture. This model has been specifically trained using Supervised Fine-Tuning (SFT) on the extensive Neelectric/OpenR1-Math-220k_all_Llama3_4096toks dataset, which focuses on mathematical problems.

Key Capabilities

  • Enhanced Mathematical Reasoning: Optimized for understanding and solving mathematical problems due to its specialized fine-tuning dataset.
  • Instruction Following: Retains strong instruction-following capabilities from its Llama-3.1-8B-Instruct base.
  • Context Handling: Benefits from a 32768 token context length, allowing for processing longer and more complex mathematical prompts.

Good For

  • Mathematical Problem Solving: Ideal for applications requiring accurate mathematical computations, derivations, and explanations.
  • Educational Tools: Suitable for developing AI tutors or systems that assist with math homework and learning.
  • Research in Mathematical AI: A strong baseline for further research and development in AI models focused on quantitative reasoning.

This model was trained using the TRL library, ensuring a robust fine-tuning process.