PruningVSQuantization/Llama-3.2-1B-Instruct-awq-bits8-seed0

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...