nm-testing/Qwen2-1.5B-Instruct-FP8-K-V

TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kArchitecture:Transformer Cold

Loading preview...