CharlesLi/llama_2_sky_safe_o1_llama_3_8B_reflect_4000_1000_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold

CharlesLi/llama_2_sky_safe_o1_llama_3_8B_reflect_4000_1000_full is a 7 billion parameter language model, fine-tuned from Meta's Llama-2-7b-chat-hf. This model was trained with a learning rate of 2e-05 and a cosine scheduler over one epoch. It achieved a validation loss of 0.6644, indicating its performance on the evaluation set.

Loading preview...