DrishtiSharma/llama-2-7b-int4-alpaca-normal-attention-tp-1-merged

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Warm

Loading preview...