CharlesLi/llama_2_sky_safe_o1_llama_3_8B_default_1000_500_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold

The CharlesLi/llama_2_sky_safe_o1_llama_3_8B_default_1000_500_full model is a 7 billion parameter language model fine-tuned from Meta's Llama-2-7b-chat-hf. This model was fine-tuned on a generator dataset, achieving a loss of 0.7590 on the evaluation set. It is intended for generative tasks, building upon the Llama 2 architecture with a 4096 token context length.

Loading preview...