nothingiisreal/MN-12B-Starcannon-v3
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Aug 13, 2024Architecture:Transformer0.0K Warm

MN-12B-Starcannon-v3 is a 12 billion parameter language model developed by nothingiisreal, created through a TIES merge of anthracite-org/magnum-12b-v2 and nothingiisreal/MN-12B-Celeste-V1.9. This model leverages a 32768 token context length and is designed as a general-purpose language model, inheriting capabilities from its merged components. It is suitable for various text generation and understanding tasks, building upon the strengths of its constituent models.

Loading preview...

MN-12B-Starcannon-v3 Overview

MN-12B-Starcannon-v3 is a 12 billion parameter language model developed by nothingiisreal. It was created using the TIES merge method from mergekit, combining two distinct pre-trained models to enhance its capabilities.

Merge Details

This model is a merge of:

  • nothingiisreal/MN-12B-Celeste-V1.9 (used as the base model)
  • anthracite-org/magnum-12b-v2

The TIES merge method was applied with specific density and weight parameters for each component, aiming to integrate their strengths. The configuration utilized bfloat16 dtype and included normalize: true and int8_mask: true parameters during the merge process.

Key Characteristics

  • Parameter Count: 12 billion parameters.
  • Context Length: Supports a context window of 32768 tokens.
  • Merge Method: Utilizes the TIES (Trimmed, Iterative, and Self-consistent) merging technique.

Availability

Various quantized versions are available for broader accessibility and deployment:

Intended Use Cases

As a merged model, MN-12B-Starcannon-v3 is designed for general-purpose language tasks, benefiting from the combined knowledge and capabilities of its constituent models. It is suitable for applications requiring robust text generation, comprehension, and conversational abilities.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p