jaspionjader/Kosmos-EVAA-Franken-Immersive-v40-8B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 14, 2025Architecture:Transformer0.0K Warm

jaspionjader/Kosmos-EVAA-Franken-Immersive-v40-8B is a merged language model created by jaspionjader using the SLERP method. This model combines jaspionjader/Kosmos-EVAA-Franken-Immersive-v39-8B and jaspionjader/knf-1-8b, integrating their respective layer ranges. It is designed to leverage the strengths of its constituent models for general language generation tasks.

Loading preview...

Model Overview

jaspionjader/Kosmos-EVAA-Franken-Immersive-v40-8B is a merged language model developed by jaspionjader. It was created using the SLERP (Spherical Linear Interpolation) merge method, a technique often employed to combine the weights of multiple pre-trained models while preserving their learned representations.

Merge Details

This model is a composite of two distinct base models:

  • jaspionjader/Kosmos-EVAA-Franken-Immersive-v39-8B: This model forms one of the primary components, contributing its learned features across all 32 layers.
  • jaspionjader/knf-1-8b: The second component, also contributing its full 32 layers, is integrated to enhance the overall model capabilities.

The merge process specifically adjusted parameters for self_attn and mlp layers with varying interpolation values, indicating a fine-tuned combination strategy to optimize performance. The base model for the merge was jaspionjader/Kosmos-EVAA-Franken-Immersive-v39-8B, suggesting a focus on building upon its existing strengths.

Intended Use

As a merged model, Kosmos-EVAA-Franken-Immersive-v40-8B is designed to inherit and potentially improve upon the general language understanding and generation capabilities of its constituent models. It is suitable for a range of applications where a robust, general-purpose language model is required.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p