dat-lequoc/vLLM-fast-apply-16bit-v0.13-Llama3.2-1B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kLicense:apache-2.0Architecture:Transformer Open Weights Warm

Loading preview...