jiinking/2_layer_GQA2_llama_model

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

Loading preview...