sangwoonoel/fixed-model
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 28, 2026Architecture:Transformer Warm
sangwoonoel/fixed-model is an 8 billion parameter Qwen3 model, based on the Qwen/Qwen3-8B architecture, with a 32K context length. This model has been specifically fixed to include the necessary chat_template in its tokenizer_config.json, enabling proper message formatting for chat-based applications. It is optimized for use with the /chat/completions API, ensuring correct ChatML template adherence.
Loading preview...
Overview
This model, sangwoonoel/fixed-model, is an 8 billion parameter Qwen3 architecture, specifically identified as Qwen3-8B through inspection of its weight map and configuration files. It features a 32K token context length.
Key Fixes and Characteristics
- Root Cause: The original
yunmorning/broken-modelwas missing thechat_templatefield in itstokenizer_config.json. - Solution: The
chat_templatefield, essential for the/chat/completionsAPI, has been added, adopting the official ChatML template expected by Qwen3 models. - Verification: The model's identity as Qwen3-8B was confirmed by analyzing
self_attn.q_norm/self_attn.k_normlayers, 36 hidden layers, and a vocab_size of 151936. Itsconfig.jsonandgeneration_config.jsonare identical to the official Qwen/Qwen3-8B. - Metadata Note: While the HuggingFace metadata incorrectly lists
base_model: meta-llama/Meta-Llama-3.1-8B, this is a display-only inaccuracy and does not affect the model's functionality or the fix implemented.
Good For
- Applications requiring reliable
chat/completionsAPI functionality with Qwen3 models. - Developers needing a Qwen3-8B model with correctly configured ChatML message formatting.
- Use cases where adherence to the
<|im_start|>/<|im_end|>token structure is critical for chat interactions.