mit-han-lab/Llama-3-8B-QServe

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kTool Calling:SupportedLicense:llama3Architecture:Transformer0.0K Cold

Loading preview...