avemio-digital/Nvidia_vicuna-13B-1.1-HF-sharded

TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

Loading preview...