Name: wvnvwn/llama-2-13b-chat-hf-lr5e-5-resta-0.1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: wvnvwn

Model Overview

The wvnvwn/llama-2-13b-chat-hf-lr5e-5-resta-0.1 is a 13 billion parameter language model derived from the Llama-2-13b-chat-hf family. It was constructed by wvnvwn using the Mergekit tool, employing a linear merge method to combine several pre-trained models.

Merge Details

This model is a composite of three distinct Llama-2-13b-chat-hf variants:

wvnvwn/llama-2-13b-chat-hf-lr5e-5-gsm8k-lr5e-5: Likely contributes to improved mathematical reasoning or problem-solving capabilities, given its name's reference to GSM8K.
wvnvwn/llama-2-13b-chat-hf-SSFT-lr5e-5: Suggests inclusion of a model fine-tuned with a specific Supervised Fine-Tuning (SSFT) approach.
meta-llama/Llama-2-13b-chat-hf: The foundational Meta Llama 2 13B chat model, providing general language understanding and generation.

The merge configuration utilized a weighted linear combination, with specific layer ranges and weights applied to each source model. This approach aims to blend the specialized strengths of the fine-tuned models with the robust base capabilities of Llama-2-13b-chat-hf.

Potential Use Cases

Given its merged components, this model could be particularly suitable for applications requiring:

General conversational AI: Leveraging the base Llama-2-13b-chat-hf's capabilities.
Reasoning tasks: Potentially enhanced by the GSM8K-tuned component.
Specific instruction-following: Benefiting from the SSFT-tuned model.

Overview

Model Overview

Merge Details

Potential Use Cases

Full Model Card (README)