amd/SAND-MathScience-DeepSeek-Qwen32B
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kLicense:otherArchitecture:Transformer0.0K Cold

amd/SAND-MathScience-DeepSeek-Qwen32B is a 32.8 billion parameter reasoning model developed by AMD, fine-tuned from DeepSeek-R1-Distill-Qwen-32B. It excels in mathematical and scientific reasoning tasks, achieving performance comparable to or surpassing next-generation models like Qwen3-32B on benchmarks such as AIME and GPQA. This model was built using a novel synthetic data pipeline on AMD ROCm™ stack and AMD Instinct™ MI325 GPUs, prioritizing data difficulty and novelty over volume.

Loading preview...