Name: mergekit-community/MS-RP-whole API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: mergekit-community

Model Overview

mergekit-community/MS-RP-whole is a 24 billion parameter language model developed by mergekit-community. It was constructed using the Model Stock merge method, a technique detailed in the paper "Model Stock: A Method for Merging Pre-trained Language Models" (arXiv:2403.19522).

Merge Details

This model's foundation is the ReadyArt/Forgotten-Safeword-24B-V2.2 base model. It incorporates components from two distinct models:

mergekit-community/MS3-RP-half1
mergekit-community/MS3-RP-RP-half2

The merging process utilized a bfloat16 data type, as specified in the merge configuration. This approach aims to combine the strengths of the constituent models into a unified, more capable language model.

Key Characteristics

Parameter Count: 24 billion parameters.
Context Length: Supports a substantial context window of 32768 tokens.
Merge Method: Employs the Model Stock method for combining pre-trained models, which is a notable differentiator in its construction.

Potential Use Cases

Given its substantial parameter count and large context window, MS-RP-whole is well-suited for applications requiring:

Advanced Language Generation: Creating coherent and contextually relevant text over long passages.
Complex Reasoning: Handling tasks that benefit from a broad understanding of input context.
General-Purpose LLM Applications: Serving as a robust backbone for various natural language processing tasks where a merged model's combined capabilities are advantageous.

Overview

Model Overview

Merge Details

Key Characteristics

Potential Use Cases

Full Model Card (README)