multilingual-pruning/pruned-pruned-llama3-8b-instruct-wanda-0.5-unstructured-mc4-de-42
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kArchitecture:Transformer Cold

Loading preview...