makotonlo/LLM2026_DPO_SFT19_v18

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 1, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The makotonlo/LLM2026_DPO_SFT19_v18, also known as Silent Expert v18, is a 7.6 billion parameter Qwen2.5-7B fine-tune with a 32768 token context length. It is specifically optimized through Direct Preference Optimization (DPO) to produce pure, raw data output without conversational filler or formatting noise. This model excels at generating immediate, structured content, making it ideal for automated data processing pipelines.

Loading preview...

LLM2026_DPO_SFT19_v18: The Silent Expert for Pure Data Output

This model, developed by makotonlo, is a specialized fine-tune of the Qwen2.5-7B architecture, featuring 7.6 billion parameters and a 32768 token context length. Named "Silent Expert v18," its core innovation lies in its training via Direct Preference Optimization (DPO) to deliver pure data output by suppressing all conversational elements and formatting noise. This makes it uniquely suited for applications requiring clean, structured data directly from the model.

Key Capabilities

  • Absolute Silence: Eliminates common conversational preambles and concluding remarks, ensuring output starts immediately with relevant data.
  • Zero Formatting Noise: Specifically trained to avoid Markdown backticks (e.g., ```json) that can complicate automated parsing.
  • Raw Data Focus: Designed to output immediate, structured content from the very first character, streamlining data extraction workflows.
  • Hybrid Delivery: Available as both a 16-bit physically merged model for standalone use and a LoRA adapter for integration into PEFT workflows, offering flexibility in deployment.

Good For

  • Automated data processing pipelines where clean, unadulterated data is critical.
  • Applications requiring structured output without any conversational or formatting overhead.
  • Developers looking for a model that minimizes post-processing needs for its output.