Steelskull/L3.3-Mokume-Gane-R1-70b-v1.1

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kPublished:Mar 2, 2025License:llama3.3Architecture:Transformer0.0K Warm

Steelskull/L3.3-Mokume-Gane-R1-70b-v1.1 is a 70 billion parameter language model developed by Steelskull, built on the DS-Hydroblated-R1 foundation using the SCE merge method. This model is part of an experimental series, specifically designed to emphasize creative and unexpected outputs, integrating components for enhanced reasoning, coherence, and detailed scene descriptions. It excels in creative expression and character adherence, offering distinctive outputs for users seeking innovative AI-generated content.

Loading preview...

Model Overview

L3.3-Mokume-Gane-R1-70b-v1.1 is a 70 billion parameter language model developed by Steelskull, named after the Japanese metalworking technique 'Mokume-gane' to reflect its layered composition and unique output. It is built upon the custom DS-Hydroblated-R1 base model and utilizes the SCE (Select, Calculate, and Erase) merge method, integrating components from various high-performance models including EVA-LLaMA-3.33-v0.0 for core capabilities, Euryale-v2.3 for enhanced reasoning, Cirrus-x1 and Hanami-x1 for coherence and balanced responses, Anubis-v1 for detailed descriptions, and Negative_LLAMA for bias reduction.

Key Capabilities

  • Exceptional Creativity: Designed to generate unique and unexpected outputs, making it stand out in creative tasks.
  • Enhanced Reasoning: Features improved reasoning capabilities, particularly when guided by structured prompting with clear logical frameworks.
  • Strong Character Adherence: Excels in maintaining consistent character traits and natural dialogue flow.
  • Detailed Scene Descriptions: Incorporates components specifically for generating rich and elaborate scene details.
  • Bias Reduction: Integrates Negative_LLAMA to help maintain perspective and reduce potential biases.

Good For

  • Creative Content Generation: Ideal for applications requiring highly imaginative and novel text outputs.
  • Roleplay and Storytelling: Its strong character adherence and ability to generate detailed scenes make it suitable for interactive narratives.
  • Exploratory AI Research: Useful for developers and researchers interested in models that push the boundaries of creative expression.

Considerations

While highly creative, the model's outputs can be variable and may require careful prompt engineering and sampler tuning (e.g., recommended static temperature of 1-1.05 and Min P of 0.03) to achieve optimal and coherent results. It benefits significantly from structured reasoning prompts, such as the LeCeption XML template, to unlock deeper analytical responses.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p