Azazelle/Tippy-Toppy-7b

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 3, 2024License:cc-by-4.0Architecture:Transformer0.0K Open Weights Cold

Tippy-Toppy-7b by Azazelle is a 7 billion parameter language model created using a DARE merge method, building upon the Toppy-M-7b model. It integrates components from Mistral-7B-v0.1, Undi95/Toppy-M-7B, PistachioAlt/Noromaid-Bagel-7B-Slerp, and OpenPipe/mistral-ft-optimized-1227. This model is designed for general language tasks, leveraging its merged architecture to combine strengths from its constituent models, and supports a 4096-token context length.

Loading preview...

Overview

Azazelle/Tippy-Toppy-7b is a 7 billion parameter language model developed through a DARE (Disentangled Representation Editing) merge. It is specifically built on the foundation of Toppy-M-7b, integrating several other models to enhance its capabilities. The merge process combines mistralai/Mistral-7B-v0.1 as the base model with weighted contributions from Undi95/Toppy-M-7B, PistachioAlt/Noromaid-Bagel-7B-Slerp, and OpenPipe/mistral-ft-optimized-1227.

Key Capabilities

  • Merged Architecture: Leverages the strengths of multiple Mistral-based models through a DARE merge, potentially offering a balanced performance across various tasks.
  • Parameter Efficiency: At 7 billion parameters, it aims to provide robust language understanding and generation while maintaining a relatively efficient footprint.
  • bfloat16 Support: Utilizes bfloat16 for improved numerical stability and potentially faster inference on compatible hardware.

Good for

  • General-purpose language tasks: Suitable for a wide range of applications requiring text generation, summarization, and question answering.
  • Experimentation with merged models: Ideal for developers interested in exploring the performance characteristics of models created via DARE merge techniques.
  • Applications requiring a 7B model: Offers an alternative to other 7B models, potentially providing unique characteristics due to its specific merge composition.