ank028/Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-slerp

TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Cold

Loading preview...