ahmadhasan/deepseek-r1-sft
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Apr 6, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
The ahmadhasan/deepseek-r1-sft is a 7.6 billion parameter Qwen2-based causal language model, fine-tuned by ahmadhasan. This model was trained using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general language generation tasks, leveraging its Qwen2 architecture for robust performance.
Loading preview...