diffnamehard/Mistral-CatMacaroni-slerp-uncensored-7B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Dec 27, 2023License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

diffnamehard/Mistral-CatMacaroni-slerp-uncensored-7B is an experimental 7 billion parameter language model, fine-tuned from Mistral-CatMacaroni-slerp-7B. It was trained on the toxic-dpo-v0.1-NoWarning-alpaca dataset, focusing on uncensored responses. The model demonstrates general language understanding capabilities with a context length of 8192 tokens, achieving notable scores across various benchmarks including HellaSwag and Winogrande.

Loading preview...

Overview

This is an experimental 7 billion parameter language model developed by diffnamehard, fine-tuned from the Mistral-CatMacaroni-slerp-7B base model. Its primary characteristic is its training on the toxic-dpo-v0.1-NoWarning-alpaca dataset, which aims to produce uncensored outputs.

Key Capabilities

  • General Language Understanding: The model exhibits capabilities across a range of common language understanding tasks.
  • Uncensored Responses: Fine-tuning on a specific dataset suggests an emphasis on generating responses without typical content filters.
  • Benchmark Performance: Achieves competitive scores on several academic benchmarks, including:
    • ARC (25-shot): 64.25
    • HellaSwag (10-shot): 84.09
    • MMLU (5-shot): 62.66
    • TruthfulQA (0-shot): 56.87
    • Winogrande (5-shot): 79.72
    • GSM8K (5-shot): 56.1

Good For

  • Experimental Use Cases: Ideal for researchers and developers exploring the behavior of models fine-tuned on specific, less-filtered datasets.
  • Comparative Analysis: Useful for comparing performance and response characteristics against other models, particularly those with standard safety alignments.
  • Applications Requiring Directness: Potentially suitable for applications where direct, unfiltered language generation is a requirement, provided ethical considerations are thoroughly addressed.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p