CharlesLi/llama_2_rlhf_safe_llama_3_8B_reflect_500_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold

The CharlesLi/llama_2_rlhf_safe_llama_3_8B_reflect_500_full model is a 7 billion parameter language model fine-tuned from Meta's Llama-2-7b-chat-hf. This model was trained with a focus on safety and reflection, utilizing RLHF techniques. It is intended for applications requiring a Llama-2-based model with enhanced safety characteristics, demonstrating a training loss of 0.8959.

Loading preview...