kmseong/llama3.2_3b_new_SSFT_lr5e-5
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Apr 6, 2026License:llama3.2Architecture:Transformer Loading

The kmseong/llama3.2_3b_new_SSFT_lr5e-5 model is a 3.2 billion parameter Llama 3.2-based language model developed by kmseong. It has undergone Phase 0 of Safety-WaRP (Weight space Rotation Process) training, specifically fine-tuned on the Circuit Breakers dataset to enhance safety responses. This model is designed to establish foundational safety mechanisms, making it suitable as a base for further safety and utility enhancements in the WaRP pipeline.

Loading preview...