Retreatcost/Chrysologus-12B

TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Retreatcost/Chrysologus-12B is a 12 billion parameter language model created by Retreatcost, designed for storytelling with improved instruction following. This model is a merge of yamatazen/EtherealAurora-12B, Sicarius-Prototyping/Impish_Longtail_12B, and allura-org/MN-Lyrebird-12B, utilizing the Karcher Mean merge method. It offers enhanced instruction adherence compared to previous models like Retreatcost/Impish-LongPen-12B, making it suitable for narrative generation tasks. The model supports a context length of 32768 tokens.

Loading preview...

Overview

Retreatcost/Chrysologus-12B is a 12 billion parameter language model specifically developed for storytelling with decent instruction following. It represents an advancement over prior models, notably offering better instruction adherence than Retreatcost/Impish-LongPen-12B.

Key Characteristics

  • Architecture: A merged model, combining three distinct pre-trained language models.
  • Merge Method: Utilizes the Karcher Mean method for model merging, a technique known for its robust averaging properties.
  • Component Models: Built from a merge of:
    • yamatazen/EtherealAurora-12B
    • Sicarius-Prototyping/Impish_Longtail_12B
    • allura-org/MN-Lyrebird-12B
  • Context Length: Supports a substantial context window of 32768 tokens, beneficial for longer narratives and complex instructions.

Use Cases

This model is particularly well-suited for applications requiring:

  • Narrative Generation: Creating coherent and engaging stories.
  • Instruction-Following: Executing prompts and instructions more accurately, especially in creative writing contexts.
  • Content Creation: Generating various forms of text where both creativity and adherence to specific guidelines are important.