normster/RealGuardrails-Qwen2.5-7B-SFT-DPO
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 17, 2025License:mitArchitecture:Transformer Open Weights Cold

The normster/RealGuardrails-Qwen2.5-7B-SFT-DPO model is a 7.6 billion parameter language model based on the Qwen2.5 architecture, featuring a 32768 token context length. Developed by normster, this model is specifically fine-tuned using Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) on the RealGuardrails dataset. Its primary differentiator is an enhanced ability to adhere to system prompts and maintain precedence, making it highly effective for applications requiring strict guardrail enforcement.

Loading preview...