jaspionjader/Kosmos-EVAA-immersive-mix-v45.1-8B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Feb 5, 2025Architecture:Transformer0.0K Warm

jaspionjader/Kosmos-EVAA-immersive-mix-v45.1-8B is an 8 billion parameter language model created by jaspionjader, merged using the SLERP method from jaspionjader/bh-58 and suayptalha/Maestro-R1-Llama-8B. This model features a 32768 token context length and is designed as a general-purpose language model, leveraging the combined strengths of its constituent models. It is suitable for a variety of text generation and understanding tasks.

Loading preview...

Model Overview

jaspionjader/Kosmos-EVAA-immersive-mix-v45.1-8B is an 8 billion parameter language model developed by jaspionjader. It was created by merging two pre-trained models, jaspionjader/bh-58 and suayptalha/Maestro-R1-Llama-8B, using the SLERP (Spherical Linear Interpolation) merge method. This technique aims to combine the strengths of the base models to produce a more robust and versatile model.

Merge Details

The merge process involved combining specific layers of the constituent models. The configuration specified a layer range of 0 to 32 for both jaspionjader/bh-58 and suayptalha/Maestro-R1-Llama-8B. The base_model for the merge was jaspionjader/bh-58. Parameters for the SLERP merge were finely tuned, with varying t values applied to self_attn and mlp components across different layers, indicating a nuanced approach to blending the model weights. The model was processed with bfloat16 data type.

Potential Use Cases

Given its origin as a merge of general-purpose language models, Kosmos-EVAA-immersive-mix-v45.1-8B is likely suitable for a broad range of applications, including:

  • Text generation: Creating coherent and contextually relevant text.
  • Content creation: Assisting with writing tasks, summarization, and expansion.
  • Conversational AI: Developing chatbots or interactive agents.
  • General language understanding: Tasks requiring comprehension and inference from text.