aadityabuilds/qwen2-5-coder-7b-kernelbook-sdft

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 27, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The aadityabuilds/qwen2-5-coder-7b-kernelbook-sdft is a 7.6 billion parameter Qwen2.5-Coder-7B-Instruct model fine-tuned by aadityabuilds using Self-Distillation Fine-Tuning (SDFT). It is specifically optimized for generating Triton GPU kernels from PyTorch module descriptions. This model excels at converting PyTorch code into Triton implementations, making it highly specialized for hardware-accelerated kernel development.

Loading preview...

Model Overview

This model, aadityabuilds/qwen2-5-coder-7b-kernelbook-sdft, is a specialized checkpoint of the Qwen2.5-Coder-7B-Instruct model, fine-tuned using Self-Distillation Fine-Tuning (SDFT). It has been post-trained on the KernelBook dataset, which consists of PyTorch module prompts paired with reference Triton kernels.

Key Capabilities

  • Triton Kernel Generation: Its primary function is to generate Triton GPU kernels directly from PyTorch-style module descriptions.
  • SDFT Training: Utilizes a unique self-distillation fine-tuning method where the model learns to reproduce reference Triton implementations by seeing the user prompt alongside privileged context.
  • Specialized Dataset: Trained on the KernelBook dataset, ensuring high relevance and accuracy for Triton kernel conversion tasks.

Intended Use Cases

This model is best suited for:

  • Converting PyTorch to Triton: Ideal for developers looking to translate PyTorch module definitions into optimized Triton kernels for GPU acceleration.
  • Hardware-Accelerated Kernel Development: Facilitates the creation of high-performance kernels by automating the conversion process.

Limitations

Due to its highly specialized training, this model may exhibit reduced performance on general coding, mathematical, or knowledge-based tasks compared to its base instruct model. It is not intended as a general-purpose chat or reasoning model.