lkongam/KernelCoder
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Sep 15, 2025Architecture:Transformer0.0K Cold

KernelCoder is a 32.8 billion parameter model developed by lkongam, specifically trained on a curated dataset of reasoning traces and CUDA kernel pairs. This model is designed for code generation, particularly excelling at generating CUDA kernels. Its specialized training makes it highly effective for tasks requiring the creation of optimized parallel computing code.

Loading preview...