automerger/YamShadow-7B

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Mar 10, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

YamShadow-7B is a 7 billion parameter language model, an automated merge created by Maxime Labonne. This model was generated using a DARE TIES merge method, combining 'mayacinka/yam-jom-7B' and 'CorticalStack/shadow-clown-7B-slerp' with specific density and weight parameters. It is designed for general text generation tasks, leveraging the combined strengths of its constituent models.

Loading preview...

YamShadow-7B: An Automated Merge Model

YamShadow-7B is a 7 billion parameter language model, developed by Maxime Labonne through an automated merging process. This model is a product of the DARE TIES merge method, specifically combining two distinct base models: 'mayacinka/yam-jom-7B' and 'CorticalStack/shadow-clown-7B-slerp'. The merge configuration involved a density of 0.53 and a weight of 0.6 for the 'shadow-clown-7B-slerp' component, with 'mayacinka/yam-jom-7B' serving as the primary base.

Key Characteristics

  • Automated Merge: Created using a structured, automated merging technique rather than traditional fine-tuning.
  • DARE TIES Method: Utilizes the DARE TIES merging algorithm, which is known for its ability to combine model strengths effectively.
  • 7 Billion Parameters: A moderately sized model, offering a balance between performance and computational efficiency.
  • Bfloat16 Precision: Configured to use bfloat16 data type, which can improve training and inference speed while maintaining reasonable accuracy.

Good For

  • General Text Generation: Suitable for a wide range of language understanding and generation tasks.
  • Experimentation with Merged Models: Provides a practical example of a model created via automated merging, useful for researchers and developers exploring model combination techniques.
  • Applications requiring a 7B model: Can be integrated into applications where a 7 billion parameter model fits the performance and resource requirements.