Name: Chaotically/model_sft_resta API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Chaotically

Model Overview

Chaotically/model_sft_resta is a 1.5 billion parameter language model developed by Chaotically. It was created using the Task Arithmetic merge method, which combines the weights of multiple pre-trained models to achieve specific characteristics.

Merge Details

This model is a composite, built upon base_safe_temp as its foundational model. It integrates two distinct models:

sft_model_temp: Incorporated with a weight of 1.0.
harmful_model_temp: Incorporated with a weight of -1.0.

This specific weighting suggests an intentional design to either enhance or mitigate certain behaviors or capabilities derived from the sft_model_temp and harmful_model_temp components, relative to the base_safe_temp.

Key Characteristics

Architecture: Merged model based on an unspecified base, likely a transformer architecture given its LLM nature.
Parameter Count: 1.5 billion parameters, making it suitable for applications requiring a balance between performance and computational efficiency.
Context Length: Supports a context window of 32768 tokens, allowing for processing and generating longer sequences of text.

Potential Use Cases

Given its unique merge configuration, model_sft_resta could be explored for applications where fine-grained control over model behavior, potentially related to safety or specific content generation, is desired. Developers might find it useful for tasks requiring a model with characteristics derived from the specific combination of its merged components.

Overview

Model Overview

Merge Details

Key Characteristics

Potential Use Cases

Full Model Card (README)