AL49/Llama-2-7b-chat-hf-NoAccelerate-sharded-bf16-2GB

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

Loading preview...