Syed-Hasan-8503/Llama-3-8B-Instruct-262k-Qserve

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kTool Calling:SupportedArchitecture:Transformer Cold

Loading preview...