mit-han-lab/Llama-3-8B-QServe
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:llama3Architecture:Transformer0.0K Cold

Loading preview...