YousefAshraf/deepseek-r1-distill-llama-8b-merged

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kArchitecture:Transformer Warm

Loading preview...