BlouseJury/Mistral-7B-Discord-0.2

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Jan 28, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

BlouseJury/Mistral-7B-Discord-0.2 is a 7 billion parameter language model, fine-tuned from Mistral-7B-v0.1 on approximately 40 million tokens of anonymized Discord messages. This base model is specifically optimized for generating and understanding conversational text in a Discord-like style, making it suitable for applications requiring informal, chat-based interactions. It achieves an average score of 59.55 on the Open LLM Leaderboard, with notable performance on HellaSwag (82.49) and MMLU (62.82).

Loading preview...

Model Overview

BlouseJury/Mistral-7B-Discord-0.2 is a 7-billion parameter language model developed by BlouseJury. It is a fine-tuned version of the original Mistral-7B-v0.1 model, specifically adapted for conversational contexts. The model underwent training for 4 epochs on a dataset comprising approximately 40 million tokens of anonymized, largely unformatted Discord messages.

Key Characteristics

  • Base Model: Fine-tuned from Mistral-7B-v0.1.
  • Training Data: Specialized on a large corpus of Discord messages, aiming to capture the nuances of informal chat.
  • Parameter Count: 7 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports an 8192-token context window.

Performance Benchmarks

Evaluated on the Open LLM Leaderboard, BlouseJury/Mistral-7B-Discord-0.2 demonstrates competitive performance for its size, with an overall average score of 59.55. Specific metric scores include:

  • HellaSwag (10-Shot): 82.49
  • MMLU (5-Shot): 62.82
  • AI2 Reasoning Challenge (25-Shot): 60.58
  • Winogrande (5-Shot): 77.74

Ideal Use Cases

This model is particularly well-suited for applications that require generating or understanding text in an informal, chat-like style, such as:

  • Discord Bot Development: Creating bots that can interact naturally within Discord channels.
  • Conversational AI: Building chatbots for informal communication platforms.
  • Social Media Content Generation: Producing text that mimics casual online discourse.
  • Text Analysis: Understanding sentiment or context in user-generated content from chat platforms.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p