thesantatitan/qwen3-0.6B-svg-sft

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:May 22, 2025Architecture:Transformer Warm

The thesantatitan/qwen3-0.6B-svg-sft model is a fine-tuned variant of the Qwen3-0.6B architecture, developed by thesantatitan. With 0.8 billion parameters and a 40960-token context length, this model specializes in text-to-SVG generation. It has been specifically trained using Supervised Fine-Tuning (SFT) on the text2svg-stack-follow-constraints dataset, making it adept at converting natural language descriptions into Scalable Vector Graphics (SVG) code.

Loading preview...

Model Overview

The thesantatitan/qwen3-0.6B-svg-sft is a specialized language model derived from the Qwen3-0.6B architecture. Developed by thesantatitan, this model has been fine-tuned using Supervised Fine-Tuning (SFT) with the TRL framework.

Key Capabilities

  • Text-to-SVG Generation: The primary function of this model is to translate natural language prompts into Scalable Vector Graphics (SVG) code. It was trained on the thesantatitan/text2svg-stack-follow-constraints dataset, which focuses on generating SVG based on textual descriptions and constraints.
  • Qwen3-0.6B Base: Leverages the foundational capabilities of the Qwen3-0.6B model, providing a robust base for its specialized task.

Training Details

The model underwent training using the TRL (Transformer Reinforcement Learning) library, specifically employing an SFT approach. This method involved fine-tuning the base Qwen3-0.6B model on a dataset designed to teach it the intricate mapping between text and SVG syntax, including adherence to specified constraints.

Good For

  • Automated SVG Creation: Ideal for applications requiring the programmatic generation of SVG images from textual input.
  • Prototyping and Design: Can assist designers and developers in quickly generating vector graphics based on descriptive text, streamlining the initial stages of design or UI development.
  • Educational Tools: Potentially useful in educational contexts to demonstrate how textual commands can translate into visual representations.