thusinh1969/llama-3.1-8B-pretrain-test-rank128-1.3B-params

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kArchitecture:Transformer Warm

Loading preview...