alvinming/es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kArchitecture:Transformer Cold

Loading preview...