avemio-digital/Nvidia_CodeLlama-7B-Instruct-bf16-sharded

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

Loading preview...