wvnvwn/llama-2-13b-chat-hf-lr5e-5-resta-0.1

TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Apr 30, 2026Architecture:Transformer Cold

The wvnvwn/llama-2-13b-chat-hf-lr5e-5-resta-0.1 model is a 13 billion parameter language model based on the Llama-2-13b-chat-hf architecture, created by wvnvwn through a linear merge using Mergekit. This model integrates components from specialized Llama-2 variants, including one fine-tuned for GSM8K and another with SSFT, alongside the base Meta Llama-2-13b-chat-hf. It is designed to leverage the combined strengths of its constituent models, potentially offering enhanced performance in areas like mathematical reasoning and general chat capabilities within its 4096-token context window.

Loading preview...

Model Overview

The wvnvwn/llama-2-13b-chat-hf-lr5e-5-resta-0.1 is a 13 billion parameter language model derived from the Llama-2-13b-chat-hf family. It was constructed by wvnvwn using the Mergekit tool, employing a linear merge method to combine several pre-trained models.

Merge Details

This model is a composite of three distinct Llama-2-13b-chat-hf variants:

  • wvnvwn/llama-2-13b-chat-hf-lr5e-5-gsm8k-lr5e-5: Likely contributes to improved mathematical reasoning or problem-solving capabilities, given its name's reference to GSM8K.
  • wvnvwn/llama-2-13b-chat-hf-SSFT-lr5e-5: Suggests inclusion of a model fine-tuned with a specific Supervised Fine-Tuning (SSFT) approach.
  • meta-llama/Llama-2-13b-chat-hf: The foundational Meta Llama 2 13B chat model, providing general language understanding and generation.

The merge configuration utilized a weighted linear combination, with specific layer ranges and weights applied to each source model. This approach aims to blend the specialized strengths of the fine-tuned models with the robust base capabilities of Llama-2-13b-chat-hf.

Potential Use Cases

Given its merged components, this model could be particularly suitable for applications requiring:

  • General conversational AI: Leveraging the base Llama-2-13b-chat-hf's capabilities.
  • Reasoning tasks: Potentially enhanced by the GSM8K-tuned component.
  • Specific instruction-following: Benefiting from the SSFT-tuned model.