Name: OccultAI/Musecuilo-12B-Model_Stock API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: OccultAI

Musecuilo-12B-Model_Stock Overview

OccultAI's Musecuilo-12B-Model_Stock is a 12 billion parameter language model built upon the MistralForCausalLM architecture, featuring a substantial 32768 token context length. This model was developed using the model_stock merge method, combining three distinct base models: mistralai/Mistral-Nemo-Instruct-2407, LatitudeGames/Muse-12B, and allura-org/Tlacuilo-12B.

Key Characteristics

Merge Method: Utilizes the model_stock merge technique, which is designed to integrate the strengths of multiple models.
Base Models: Merges Mistral-Nemo-Instruct-2407, Muse-12B, and Tlacuilo-12B.
Recommended Templates: Optimized for use with Mistral Tekken or ChatML chat templates to achieve the best results.
Configuration: The merge configuration specifies filter_wise: true and uses bfloat16 for the output data type.

Usage Considerations

While Musecuilo-12B-Model_Stock is a capable model, users should be aware that it may exhibit some refusals. The README suggests that these can potentially be addressed through jailbreaking or ablation techniques, depending on the specific application requirements.

Overview

Musecuilo-12B-Model_Stock Overview

Key Characteristics

Usage Considerations

Full Model Card (README)