jiinking/1_layer_GQA4_llama_model

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kTool Calling:SupportedArchitecture:Transformer Warm

Loading preview...