masani/SFT_modgsm8k_Llama-3.2-1B_epoch_1_global_step_25

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...