Name: nbeerbower/Flammen-Trismegistus-7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: nbeerbower

Model Overview

Flammen-Trismegistus-7B is a 7 billion parameter language model developed by nbeerbower. It was created through a strategic merge of two pre-existing models: nbeerbower/flammen3X and teknium/Mistral-Trismegistus-7B. This merging process utilized the SLERP (Spherical Linear Interpolation) method, a technique often employed to combine the strengths of different models while maintaining coherence.

Merge Details

The model integrates the full layer ranges (0 to 32) from both nbeerbower/flammen3X and teknium/Mistral-Trismegistus-7B. The SLERP method was applied with specific parameter weighting, particularly for self_attn and mlp layers, to fine-tune the blend of characteristics from the base models. The merge was configured to use bfloat16 data type, optimizing for efficiency and performance.

Key Characteristics

Merged Architecture: Combines the features of flammen3X and Mistral-Trismegistus-7B.
SLERP Method: Utilizes a sophisticated merging technique for balanced integration.
7 Billion Parameters: Offers a substantial capacity for complex language understanding and generation.
4096 Token Context: Supports a reasonable context window for various applications.

Potential Use Cases

Given its merged nature, Flammen-Trismegistus-7B is expected to be suitable for a range of general-purpose NLP tasks, including text generation, summarization, and question answering, leveraging the combined strengths of its parent models.

Overview

Model Overview

Merge Details

Key Characteristics

Potential Use Cases

Full Model Card (README)