kmseong/llama3.2_3b_new_SSFT_lr3e-5_nowramupratio
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Apr 4, 2026License:llama3.2Architecture:Transformer Loading

The kmseong/llama3.2_3b_new_SSFT_lr3e-5_nowramupratio model is a 3.2 billion parameter Llama 3.2-based instruction-tuned language model developed by kmseong. It has undergone Phase 0 of Safety-WaRP (Weight space Rotation Process) using the Circuit Breakers dataset, focusing on base safety training. This model is specifically designed to provide safe responses by mitigating harmful outputs, serving as a foundational safety layer for further development. It is optimized for establishing safety mechanisms, though its general utility may be reduced at this stage.

Loading preview...