TarhanE/sft-count_loss-Qwen3-0.6B-mle0.5-ul0.5-tox1.0-e4
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Jun 8, 2025Architecture:Transformer Warm

Loading preview...