PS4Research/nB8hY3fD6sQ1cX5w
TEXT GENERATIONConcurrency Cost:2Model Size:24BQuant:FP8Ctx Length:32kPublished:May 11, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
PS4Research/nB8hY3fD6sQ1cX5w is a 24 billion parameter Mistral-based model developed by PS4Research, fine-tuned from unsloth/Magistral-Small-2506-unsloth-bnb-4bit. This model was trained with Unsloth and Huggingface's TRL library, emphasizing faster training efficiency. It offers a 32768 token context length, making it suitable for applications requiring extensive context processing.
Loading preview...
Model Overview
PS4Research/nB8hY3fD6sQ1cX5w is a 24 billion parameter language model developed by PS4Research. It is fine-tuned from the unsloth/Magistral-Small-2506-unsloth-bnb-4bit model, leveraging the Mistral architecture.
Key Characteristics
- Architecture: Based on the Mistral family of models.
- Parameter Count: 24 billion parameters, offering a balance between performance and computational requirements.
- Context Length: Supports a substantial context window of 32768 tokens, enabling it to process and generate longer sequences of text.
- Training Efficiency: This model was fine-tuned using Unsloth and Huggingface's TRL library, which is noted for enabling significantly faster training times (up to 2x faster).
Potential Use Cases
Given its large context window and efficient training methodology, this model is potentially well-suited for:
- Applications requiring processing of long documents or conversations.
- Tasks benefiting from a robust understanding of extensive contextual information.
- Scenarios where efficient fine-tuning is a priority for adaptation to specific datasets.