minchaoh2002/Qwen3-14B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch_step_12

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:14BQuant:FP8Ctx Length:32kPublished:May 8, 2026Architecture:Transformer Warm

Loading preview...