Chaotically/model_sft_resta
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Apr 3, 2026Architecture:Transformer Cold

Chaotically/model_sft_resta is a 1.5 billion parameter language model created by Chaotically using the Task Arithmetic merge method. This model combines 'sft_model_temp' and 'harmful_model_temp' with 'base_safe_temp' as its base. It is designed to leverage the strengths of its constituent models, offering a unique blend of capabilities for various language tasks.

Loading preview...

Model Overview

Chaotically/model_sft_resta is a 1.5 billion parameter language model developed by Chaotically. It was created using the Task Arithmetic merge method, which combines the weights of multiple pre-trained models to achieve specific characteristics.

Merge Details

This model is a composite, built upon base_safe_temp as its foundational model. It integrates two distinct models:

  • sft_model_temp: Incorporated with a weight of 1.0.
  • harmful_model_temp: Incorporated with a weight of -1.0.

This specific weighting suggests an intentional design to either enhance or mitigate certain behaviors or capabilities derived from the sft_model_temp and harmful_model_temp components, relative to the base_safe_temp.

Key Characteristics

  • Architecture: Merged model based on an unspecified base, likely a transformer architecture given its LLM nature.
  • Parameter Count: 1.5 billion parameters, making it suitable for applications requiring a balance between performance and computational efficiency.
  • Context Length: Supports a context window of 32768 tokens, allowing for processing and generating longer sequences of text.

Potential Use Cases

Given its unique merge configuration, model_sft_resta could be explored for applications where fine-grained control over model behavior, potentially related to safety or specific content generation, is desired. Developers might find it useful for tasks requiring a model with characteristics derived from the specific combination of its merged components.