CharlesLi/llama_2_sky_safe_o1_llama_3_8B_default_4000_500_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold

The CharlesLi/llama_2_sky_safe_o1_llama_3_8B_default_4000_500_full model is a 7 billion parameter Llama-2-7b-chat-hf variant, fine-tuned on a specific generator dataset. This model is optimized for tasks related to its training data, demonstrating a validation loss of 0.6327. It is suitable for applications requiring a specialized Llama-2 based language model with a 4096 token context length.

Loading preview...