lodrick-the-lafted/Kudzu-8B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Kudzu-8B is an 8 billion parameter merged language model developed by lodrick-the-lafted, combining several Llama-3 based models including Olethros-8B, Limon-8B, Rummage-8B, and others. This model is designed to offer strong intelligence while mitigating common Llama-3 conversational quirks, making it suitable for diverse generative AI applications. It leverages a context length of 8192 tokens, providing ample capacity for complex prompts and responses.

Loading preview...

Kudzu-8B Overview

Kudzu-8B is an 8 billion parameter language model created by lodrick-the-lafted through a merge using the mergekit-evolve framework. This model is a composite of several specialized 8B models, including:

  • lodrick-the-lafted/Olethros-8B
  • lodrick-the-lafted/Limon-8B
  • lodrick-the-lafted/Rummage-8B
  • Edgerunners/meta-llama-3-8b-instruct-hf-ortho-baukit-10fail-1000total
  • cgato/L3-TheSpice-8b-v0.8.3

Key Characteristics

The merging process utilized wmdp as the scoring method for evolve. A notable characteristic of Kudzu-8B, based on limited testing, is its ability to retain a significant portion of the base Llama-3 intelligence while reportedly reducing the frequency of typical Llama-3 conversational interjections like "Ahaha!". The model's composition includes several ablated models, which means it is designed to be direct in its responses.

Potential Use Cases

Kudzu-8B is well-suited for applications requiring a capable 8B model that can generate intelligent and coherent text without the specific conversational patterns sometimes observed in its Llama-3 lineage. Its design suggests it aims for directness and efficiency in fulfilling user requests, making it a strong candidate for general-purpose text generation, summarization, and question-answering where a less verbose or idiosyncratic output is preferred.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p