Name: Sakalti/ultiima-14B-v0.2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Sakalti

Model Overview

Sakalti/ultiima-14B-v0.2 is a 14.8 billion parameter language model, notable for its creation using the TIES (Trimmed, Iterative, and Self-Referential) merge method. This approach combines the weights of multiple pre-trained models to synthesize a new model, aiming to leverage their collective strengths.

Merge Details

This model was constructed by merging sometimesanotion/Lamarck-14B-v0.6 into sometimesanotion/Lamarck-14B-v0.7, which served as the base model. The TIES method, as described in the TIES paper, was applied with specific parameters including a weight and density of 1 for both the merged and base models, with normalization and int8 masking enabled. The model operates with a float16 data type.

Key Characteristics

Architecture: Based on the Lamarck-14B series, indicating a foundation designed for general language understanding and generation tasks.
Parameter Count: 14.8 billion parameters, placing it in the medium-large scale category for language models.
Context Length: Supports a substantial context window of 32768 tokens, allowing for processing and generating longer sequences of text.
Development Method: Utilizes the TIES merging technique, suggesting an emphasis on combining and refining existing model capabilities rather than training from scratch.

Potential Use Cases

Given its merged nature and substantial context length, ultiima-14B-v0.2 could be suitable for applications requiring:

Enhanced general-purpose text generation: Benefiting from the combined knowledge of its constituent models.
Long-form content understanding: Leveraging its 32768 token context for tasks like summarization or detailed analysis of extensive documents.
Exploration of merged model performance: Ideal for researchers and developers interested in the practical application and performance characteristics of models created via advanced merging techniques like TIES.

Overview

Model Overview

Merge Details

Key Characteristics

Potential Use Cases

Full Model Card (README)