violetxi/sft_tir_rl_prep_Llama_lr0.0001_bs64_wd0.0_wp0.1_checkpoint-epoch1

TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Cold

Loading preview...