mit-han-lab/Llama-3-8B-Instruct-QServe-g128

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kTool Calling:SupportedLicense:llama3Architecture:Transformer0.0K Cold

Loading preview...