mlfoundations-dev/ot3_300k_ckpt-epoch4

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kArchitecture:Transformer Cold

Loading preview...