FinaPolat/RAISED_Mistral-Nemo_GRPO

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:May 31, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

FinaPolat/RAISED_Mistral-Nemo_GRPO is a 12 billion parameter Mistral-based language model developed by FinaPolat, featuring a 32768 token context length. This model was fine-tuned using the Unsloth library and Huggingface's TRL, enabling a 2x faster training process. It is a fine-tuned iteration of FinaPolat/RAISED_Mistral-Nemo_SFT, optimized for efficient development workflows.

Loading preview...

Model Overview

FinaPolat/RAISED_Mistral-Nemo_GRPO is a 12 billion parameter language model built upon the Mistral architecture, developed by FinaPolat. It boasts a substantial context length of 32768 tokens, making it suitable for processing longer sequences of text.

Key Characteristics

  • Architecture: Based on the Mistral model family.
  • Parameter Count: 12 billion parameters.
  • Context Length: Supports up to 32768 tokens.
  • Training Efficiency: Fine-tuned using the Unsloth library and Huggingface's TRL, which facilitated a 2x faster training process compared to standard methods.
  • Lineage: This model is a further fine-tuned version of FinaPolat/RAISED_Mistral-Nemo_SFT.

Intended Use

This model is designed for developers seeking a Mistral-based LLM that benefits from accelerated fine-tuning techniques. Its efficient training methodology makes it a practical choice for projects requiring rapid iteration and deployment of custom models.