Khetterman/DarkAtom-12B-v3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Nov 13, 2024Architecture:Transformer0.0K Warm

Khetterman/DarkAtom-12B-v3 is a 12 billion parameter language model created by Khetterman, featuring a 32768-token context length. This model is a complex merge of 18 distinct base models, utilizing a multi-step merging process including Slerp, ModelStock, and Ties methods. It is designed to synthesize diverse capabilities from its constituent models, offering a broad range of potential applications.

Loading preview...

DarkAtom-12B-v3 Overview

DarkAtom-12B-v3 is a 12 billion parameter language model developed by Khetterman, distinguished by its unique construction as a merge of 18 different base models. This intricate merging process, orchestrated using mergekit, involved a multi-step approach combining Slerp, ModelStock, and Ties methods, along with re-merging variations to achieve its final form. The model integrates diverse characteristics from its numerous components, aiming to leverage their collective strengths.

Key Capabilities

  • Synthesized Intelligence: Combines the strengths of 18 distinct models, potentially offering a broad and versatile range of capabilities across various domains.
  • Advanced Merging Techniques: Utilizes a sophisticated multi-step merging strategy (Slerp, ModelStock, Ties) to integrate model weights effectively.
  • Extensive Context Window: Supports a context length of 32768 tokens, enabling processing of longer inputs and maintaining conversational coherence over extended interactions.

Good For

  • Exploratory AI Research: Ideal for researchers and developers interested in the outcomes of complex model merging and the emergent properties of such combinations.
  • Diverse Task Handling: Its merged nature suggests potential for handling a wide array of tasks, from creative generation to more analytical applications, depending on the strengths of its constituent models.
  • Experimentation with Merged Architectures: Provides a robust platform for experimenting with and evaluating the performance of models created through advanced merging techniques.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p