ahczhg/Llama-3.2-1B-Aegis-SFT-DPO
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kPublished:Nov 15, 2025License:llama3.2Architecture:Transformer0.0K Loading
ahczhg/Llama-3.2-1B-Aegis-SFT-DPO is a 1.23 billion parameter Llama 3.2 model fine-tuned by ahczhg for content-safe instruction following. Utilizing a two-stage Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) approach on the NVIDIA Aegis AI Content Safety Dataset 2.0, this model is optimized to generate responsible and aligned responses. It excels in educational tools, content safety research, and prototype development requiring safety-aware text generation.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–