CharlesLi/llama_2_rlhf_safe_4o_default_100_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold

The CharlesLi/llama_2_rlhf_safe_4o_default_100_full model is a 7 billion parameter Llama-2-7b-chat-hf variant, fine-tuned by CharlesLi for safety-aligned chat applications. This model leverages Reinforcement Learning from Human Feedback (RLHF) to enhance its conversational safety and adherence to desired behavioral norms. It is specifically designed for use cases requiring a robust and safety-conscious conversational AI.

Loading preview...