arbazsiddiqui/Ozan-v1-12B
Ozan-v1-12B is a 12 billion parameter creative-writing model developed by arbazsiddiqui, fine-tuned from Mistral-Nemo-Instruct-2407 with a 32768 token context length. It is specifically optimized to produce long-form fiction that avoids common "LLM tells" or "slop," achieving the lowest slop metric among runnable 12B models tested. This model excels at generating high-quality, natural-sounding prose for creative writing applications.
Loading preview...
Ozan-v1-12B: A Low-Slop Creative Writing Model
Ozan-v1-12B, developed by arbazsiddiqui, is a 12 billion parameter model fine-tuned from Mistral-Nemo-Instruct-2407 with a 32K context length. Its core innovation is the deliberate reduction of "slop"—overused, generic phrases common in AI-generated text—making its output sound more human and less like an LLM. This is achieved through a unique training methodology.
Key Capabilities & Training
- Low Slop Generation: Ozan-v1-12B is specifically trained and measured to minimize common LLM tells, achieving the lowest slop metric among comparable 12B models on the EQ-Bench Creative Writing v3 harness.
- Creative Writing Focus: It is designed for long-form fiction, producing prose that is fluent, coherent, and strong in voice and tone.
- Hybrid Training: The model underwent Supervised Fine-Tuning (SFT) using QLoRA on ~17k curated, low-slop examples from non-GPT models and human writing prompts. This was followed by Gutenberg anti-slop DPO (Direct Preference Optimization), which preference-tuned the model towards real public-domain book prose.
- Performance: While excelling in slop reduction (5.30 slop, 3.21 repetition), it also shows competitive craft scores (58.8/100) against stronger low-slop 12B finetunes, significantly improving over its base model.
Use Cases
- Long-form Fiction: Ideal for generating stories, chapters, or other extensive creative narratives.
- Character Dialogue: Capable of producing distinct character voices, as demonstrated in provided samples.
- Avoiding AI-isms: Suitable for applications where the goal is to produce text that is indistinguishable from human writing, particularly in creative contexts.
Technical Details
- Base Model: Mistral-Nemo-Instruct-2407.
- Quantization: Available in GGUF quants (Q4_K_M, Q5_K_M, Q6_K, Q8_0) with importance matrix (imatrix) for optimal quality.
- Recommended Sampler Settings: Uses
temperature 0.7,min_p 0.1, and DRY (Dynamic Repetition control) for clean, long outputs.