mlfoundations-dev/s1K_llama3.1_8b_32kcontext

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kLicense:llama3.1Architecture:Transformer Cold

Loading preview...