FinaPolat/RAISED_Mistral-Nemo_DPO_1Krandom

TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 1, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

FinaPolat/RAISED_Mistral-Nemo_DPO_1Krandom is a 12 billion parameter Mistral-based language model developed by FinaPolat. This model was fine-tuned using Unsloth and Huggingface's TRL library, resulting in a 2x faster training process. It is optimized for tasks benefiting from efficient fine-tuning and builds upon the FinaPolat/RAISED_Mistral-Nemo_SFT model.

Loading preview...

Overview

FinaPolat/RAISED_Mistral-Nemo_DPO_1Krandom is a 12 billion parameter language model developed by FinaPolat. It is based on the Mistral architecture and was fine-tuned from the FinaPolat/RAISED_Mistral-Nemo_SFT model. A key characteristic of this model's development is its training efficiency, having been fine-tuned 2x faster using the Unsloth library in conjunction with Huggingface's TRL library.

Key Capabilities

  • Efficient Fine-tuning: Benefits from a training process that was twice as fast due to the use of Unsloth and Huggingface's TRL library.
  • Mistral-based Architecture: Leverages the established capabilities of the Mistral model family.
  • DPO Fine-tuning: Implies a focus on alignment and preference learning, building on its SFT predecessor.

Good For

  • Developers seeking a Mistral-based model that has undergone an optimized and faster fine-tuning process.
  • Applications where the base Mistral capabilities are desired, potentially with improved alignment from DPO.
  • Experimentation with models fine-tuned using Unsloth for efficiency.