CharlesLi/llama_2_rlhf_safe_4o_reflect_500_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold
The CharlesLi/llama_2_rlhf_safe_4o_reflect_500_full model is a 7 billion parameter Llama 2-based causal language model, fine-tuned from Meta's Llama-2-7b-chat-hf. This model has undergone additional fine-tuning on a generator dataset, achieving a loss of 1.2095 on its evaluation set. It is designed for conversational AI applications, leveraging its Llama 2 foundation with specific RLHF safety and reflection optimizations.
Loading preview...