Name: ank028/Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-linear API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ank028

Model Overview

This model, ank028/Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-linear, is a 1 billion parameter instruction-tuned language model. It was created by ank028 using the mergekit tool, specifically employing the linear merge method.

Key Capabilities

The model's capabilities are derived from its constituent components:

Commonsense Question Answering: Inherits strengths from ank028/Llama-3.2-1B-Instruct-commonsense_qa, making it proficient in understanding and responding to queries requiring general world knowledge and practical reasoning.
Mathematical Reasoning: Benefits from autoprogrammer/Llama-3.2-1B-Instruct-MGSM8K-sft1, which was fine-tuned on the MGSM8K dataset, indicating an aptitude for elementary mathematical problem-solving.

Merge Details

The model was constructed by merging two Llama 3.2-1B-Instruct base models with equal weighting (0.5 each) using a linear merge strategy. This approach aims to combine the distinct specializations of the source models into a single, more versatile model.

Good For

This model is particularly well-suited for applications that require a combination of:

General-purpose instruction following.
Commonsense reasoning tasks.
Basic mathematical problem-solving.

Its compact 1 billion parameter size makes it efficient for deployment in resource-constrained environments while offering specialized capabilities in its target domains.

Overview

Model Overview

Key Capabilities

Merge Details

Good For

Full Model Card (README)