mit-han-lab/Llama-3-8B-Instruct-QServe-g128
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:llama3Architecture:Transformer0.0K Cold

Loading preview...