Name: allknowingroger/Gemma2Slerp1-2.6B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: allknowingroger

Model Overview

allknowingroger/Gemma2Slerp1-2.6B is a 2.6 billion parameter language model developed by allknowingroger. This model was created using the SLERP (Spherical Linear Interpolation) merge method, combining the capabilities of two distinct base models:

Lil-R/2_PRYMMAL-ECE-2B-SLERP-V2
zake7749/gemma-2-2b-it-chinese-kyara-dpo

Merge Details

The SLERP merge method was applied with a specific configuration to blend the characteristics of the base models. The base_model for the merge was Lil-R/2_PRYMMAL-ECE-2B-SLERP-V2, and the process utilized bfloat16 data type. The parameters configuration suggests a V-shaped curve for the interpolation, aiming to balance the contributions of the models across different layers.

Key Characteristics

This merged model inherits properties from its constituents, which include a model potentially focused on specific instruction-tuned tasks (indicated by "it" in one base model's name) and another that might contribute to general language understanding. The merging process aims to create a more robust and versatile model by combining these specialized strengths.

Potential Use Cases

Given its merged nature, this model could be suitable for applications requiring a blend of general language capabilities and potentially specialized instruction-following or language-specific (e.g., Chinese) tasks, depending on the dominant characteristics inherited from its base models.

Overview

Model Overview

Merge Details

Key Characteristics

Potential Use Cases

Full Model Card (README)