multilingual-pruning/pruned-pruned-llama3-8b-instruct-wanda-0.5-unstructured-mc4-de-42

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kArchitecture:Transformer Cold

Loading preview...