kavinilavan/Llama-2-13b-chat-hf-array_4bit_new_prompt

TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

Loading preview...