kmseong/llama3.2_3b_new_SSFT_lr2e-5
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Apr 4, 2026License:llama3.2Architecture:Transformer Loading

The kmseong/llama3.2_3b_new_SSFT_lr2e-5 is a 3.2 billion parameter Llama 3.2-based instruction-tuned model developed by kmseong. This model has undergone Phase 0 of Safety-WaRP (Weight space Rotation Process) using the Circuit Breakers dataset, focusing on base safety training. It is designed to provide safe responses, particularly in handling harmful prompts, though its general utility may be reduced at this stage. The model has a context length of 32768 tokens.

Loading preview...