ahmedheakl/cass-sm4090-3b
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:May 10, 2025License:otherArchitecture:Transformer Warm

The ahmedheakl/cass-sm4090-3b model is a 3.1 billion parameter instruction-tuned causal language model, fine-tuned from Qwen/Qwen2.5-Coder-3B-Instruct. It was trained on specific CUDA and AMD datasets, suggesting an optimization for code-related tasks, particularly those involving GPU architectures. This model is designed for applications requiring a compact yet capable code-focused LLM.

Loading preview...