Name: itsmepv/model_sft_resta API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: itsmepv

Model Overview

itsmepv/model_sft_resta is a 1.5 billion parameter language model developed by itsmepv. It was constructed using the MergeKit tool, specifically employing the Task Arithmetic merge method. The base model for this merge is Qwen/Qwen2.5-1.5B-Instruct, which provides a robust foundation with a 32,768 token context length.

Merge Details

This model integrates two distinct components: ./fused_sft_full and ./fused_harmful_full. The Task Arithmetic method was applied with a weight of 1.0 for fused_sft_full and a weight of -1.0 for fused_harmful_full. This configuration suggests a deliberate modification of the base model's behavior, potentially to enhance certain desired characteristics while mitigating others. The use of bfloat16 for dtype indicates an optimization for efficiency and performance during inference.

Potential Use Cases

Given its unique merging strategy, model_sft_resta is likely tailored for specific applications where the combined or modified characteristics of its merged components are beneficial. Developers should consider its specialized nature when evaluating its suitability for tasks that might leverage the effects of 'fused_sft_full' and the inverse effect of 'fused_harmful_full'.

Overview

Model Overview

Merge Details

Potential Use Cases

Full Model Card (README)