JayZenith/GLYPH_SFT

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 20, 2026Architecture:Transformer Warm

GLYPH_SFT is a 4 billion parameter instruction-tuned causal language model developed by JayZenith, based on Qwen3-4B-Base. It is specifically fine-tuned for generating rigid, GLYPH-style traces, demonstrating high accuracy on a formal evaluation suite. This model excels at providing concise, structured responses, making it suitable for applications requiring precise output formatting.

Loading preview...

GLYPH_SFT Overview

JayZenith's GLYPH_SFT is a 4 billion parameter language model, a full fine-tune of Qwen/Qwen3-4B-Base. Its primary distinction lies in its specialized training for generating rigid, GLYPH-style traces, which are structured, formal outputs. The model was fine-tuned using the JayZenith/GLYPH_SFT_DATASET.

Key Capabilities

  • Structured Output Generation: Designed to produce highly formatted, GLYPH-style responses, as demonstrated by its example output structure including plan, act, and response blocks.
  • High Accuracy: Achieved 96/100 on a 100-prompt formal evaluation suite, with no exact user-prompt overlaps in the training data, indicating strong generalization.
  • Concise Explanations: Excels at providing compact and conceptual explanations, as shown in its Rust lifetime example.
  • Improved Perplexity: Significantly reduced held-out perplexity from 9.44 to 1.39, suggesting better language modeling and coherence compared to its base model.

Good For

  • Applications requiring formal, structured outputs: Ideal for systems where responses need to adhere to a specific, predefined format or schema.
  • Code explanation and assistance: Particularly effective for tasks like explaining programming concepts concisely, as illustrated with Rust examples.
  • Automated content generation with strict formatting rules: Useful in scenarios where free-form text generation is undesirable and precise output control is paramount.

This model represents the Supervised Fine-Tuning (SFT) checkpoint intended for further development in Reinforcement Learning from Human Feedback (RLHF).