CharlesLi/llama_2_rlhf_safe_llama_3_8B_reflect_1000_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold

The CharlesLi/llama_2_rlhf_safe_llama_3_8B_reflect_1000_full model is a 7 billion parameter Llama 2 based language model, fine-tuned from meta-llama/Llama-2-7b-chat-hf. This model was trained using a reflection dataset, aiming to enhance safety and alignment through Reinforcement Learning from Human Feedback (RLHF) principles. It is designed for general language generation tasks where safety and adherence to RLHF objectives are prioritized.

Loading preview...