R1pathak/TinyLlama_v1.1_int8_0.0

TEXT GENERATIONConcurrency Cost:1Model Size:1.1BQuant:BF16Ctx Length:2kArchitecture:Transformer Cold

Loading preview...