narabzad/train-s1-decontam-deepseek-checkpoint-625
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Jan 9, 2026Architecture:Transformer Cold

Loading preview...