murugeshmarvel/QAD-llama3.1-8B-iter4-fft

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kArchitecture:Transformer Cold

Loading preview...