DJCheng/LLaMA3.2-1B-Instruct-Latent-SFT-Top10

TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:May 11, 2026Architecture:Transformer Cold

Loading preview...