masani/SFT_gsm8k-t2_Llama-3.2-1B_epoch_1_global_step_15

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...