crestf411/L3.1-8B-Slush-v1.1

Warm
Public
8B
FP8
32768
License: llama3
Hugging Face
Overview

Overview

crestf411's L3.1-8B-Slush-v1.1 is an 8 billion parameter model built upon the Llama 3.1 architecture, designed to address the base model's limitations in creativity and imagination. It employs a unique two-stage training methodology:

  • Stage 1 (Pretraining Continuation): Focuses on boosting the model's creativity and writing abilities through high LoRA dropout, then merging this into the instruction-tuned base.
  • Stage 2 (Fine-tuning): Further refines roleplaying capabilities and mitigates any potential degradation from the initial merge.

This model incorporates a custom merge using MergeKit with the TIES method, targeting the meta-llama/Llama-3.1-8B as its base. Training parameters were adjusted in v1.1 based on feedback, including specific LoRA configurations (rank 64, alpha 128 for stage 1; rank 32, alpha 64 for stage 2) and the use of LoRA+.

Key Capabilities

  • Enhanced Creativity: Aims to improve imaginative text generation.
  • Improved Writing: Designed for better overall writing quality.
  • Strong Roleplaying: Specifically fine-tuned to excel in interactive roleplay scenarios, following the Silly Tavern preset.

Good For

  • Applications requiring creative text generation.
  • Interactive storytelling and roleplaying.
  • Use cases where imagination and nuanced responses are critical.