narabzad/train_s1k_queries_on_math_data_test_template2.deepseek_all_full-checkpoint-625
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Jan 27, 2026Architecture:Transformer Cold

Loading preview...