Name: nikhilkumar42/model_sft_resta API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: nikhilkumar42

Model_sft_resta Overview

nikhilkumar42/model_sft_resta is a 1.5 billion parameter language model created by nikhilkumar42 through a merge of pre-trained models. It utilizes the Task Arithmetic merge method, building upon nikhilkumar42/model_harmful_full as its base model. The merge also incorporates Qwen/Qwen2.5-1.5B-Instruct, combining their capabilities.

Key Characteristics

Architecture: A merged model, combining nikhilkumar42/model_harmful_full and Qwen/Qwen2.5-1.5B-Instruct.
Parameter Count: 1.5 billion parameters, offering a balance between performance and computational efficiency.
Context Length: Supports a substantial context window of 32768 tokens, enabling the processing of longer inputs and maintaining conversational coherence over extended interactions.
Merge Method: Employs the Task Arithmetic method, suggesting an optimization for specific task performance by combining learned weights from different models.

Intended Use Cases

This model is particularly suited for applications that benefit from the combined strengths of its merged components. Developers looking for a model with a specific fine-tuning focus, derived from the nikhilkumar42/model_harmful_full base and enhanced by Qwen/Qwen2.5-1.5B-Instruct, should consider model_sft_resta. Its large context window makes it suitable for tasks requiring extensive input understanding or generation.

Overview

Model_sft_resta Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)