crestf411/MN-Slush
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Nov 20, 2024Architecture:Transformer0.0K Warm
crestf411/MN-Slush is a 12 billion parameter, two-stage fine-tuned language model based on Mistral-Nemo-Base-2407, developed by crestf411. It is specifically optimized to enhance creativity, writing capabilities, and roleplaying performance through a unique LoRA dropout training methodology. The model leverages a continued pretraining stage to boost creative output, followed by a fine-tuning stage to refine instruction adherence and roleplaying, making it suitable for generative text applications requiring imaginative and interactive responses.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
presence_penalty
repetition_penalty
min_p