Kukedlc/NeuralKuke-4-All-7b

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Mar 30, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

Kukedlc/NeuralKuke-4-All-7b is a 7 billion parameter language model merged from several specialized Kukedlc models, including those focused on ARC, Wino, QA, and Maths tasks. This model leverages a DARE TIES merge method to combine diverse capabilities, aiming for a broad range of general-purpose applications. With an 8192 token context length, it is designed to handle complex queries requiring robust reasoning and factual recall across multiple domains.

Loading preview...

NeuralKuke-4-All-7b: A Merged 7B Model for Diverse Tasks

NeuralKuke-4-All-7b is a 7 billion parameter language model developed by Kukedlc, created by merging five distinct specialized models using the LazyMergekit with a dare_ties merge method. This approach combines the strengths of its constituent models to offer a versatile solution.

Key Capabilities

  • Broad Task Coverage: Integrates capabilities from models specifically trained for:
    • ARC (AI2 Reasoning Challenge): Enhances reasoning and common-sense understanding.
    • Wino (Winograd Schema Challenge): Improves pronoun resolution and contextual understanding.
    • QA (Question Answering): Boosts factual recall and information extraction.
    • Maths: Strengthens mathematical problem-solving abilities.
  • Merged Architecture: Built upon Kukedlc/NeuralSirKrishna-7b as the base model, with weighted contributions from the specialized Neural-4 models.
  • Configuration: Utilizes float16 dtype and includes int8_mask for potential optimization.

Good For

  • General-purpose applications requiring a blend of reasoning, factual knowledge, and mathematical skills.
  • Developers looking for a 7B model with a wide array of pre-integrated specialized capabilities, reducing the need for multiple fine-tuned models.
  • Experimentation with merged models and understanding the dare_ties method's impact on performance across diverse tasks.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p