Intel/neural-chat-7b-v3-2

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Nov 21, 2023License:apache-2.0Architecture:Transformer0.1K Open Weights Cold

Intel/neural-chat-7b-v3-2 is a 7 billion parameter large language model developed by Intel, fine-tuned from Mistral-7B-v0.1. This model was further fine-tuned on the MetaMathQA dataset using Direct Performance Optimization (DPO) with Intel/orca_dpo_pairs, specifically enhancing its mathematical reasoning capabilities. With an 8192-token context length, it is optimized for language-related tasks, particularly those requiring step-by-step mathematical problem-solving.

Loading preview...

Model Overview

Intel/neural-chat-7b-v3-2 is a 7 billion parameter large language model developed by Intel, building upon the Mistral-7B-v0.1 architecture. It was fine-tuned on the Intel Gaudi 2 processor, leveraging the meta-math/MetaMathQA dataset and aligned using Direct Performance Optimization (DPO) with Intel/orca_dpo_pairs. This specific training regimen aims to enhance its performance in mathematical reasoning and problem-solving.

Key Capabilities

  • Mathematical Reasoning: Fine-tuned on MetaMathQA, the model excels at providing step-by-step solutions and explanations for math problems.
  • Instruction Following: Aligned using DPO, it is designed to follow instructions effectively for various language-related tasks.
  • Context Length: Supports an 8192-token context window, similar to its base model, Mistral-7B-v0.1.
  • Hardware Optimization: Developed and optimized for Intel Gaudi 2 processors, with options for inference on NVIDIA GPUs.

Performance Metrics

On the Open LLM Leaderboard, Intel/neural-chat-7b-v3-2 achieved an average score of 68.29, with notable scores including 67.49 on ARC (25-shot), 83.92 on HellaSwag (10-shot), and 55.12 on GSM8K (5-shot), indicating its general language understanding and specific math capabilities.

Good For

  • Applications requiring detailed, step-by-step mathematical problem-solving.
  • General language-related inference tasks where a 7B parameter model is suitable.
  • Developers working with Intel Gaudi 2 hardware seeking an optimized LLM.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p