CharlesLi/llama_2_sky_o1_4_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 13, 2025License:llama2Architecture:Transformer Open Weights Cold
The CharlesLi/llama_2_sky_o1_4_full is a 7 billion parameter language model, fine-tuned from Meta's Llama-2-7b-chat-hf. This model was trained on a generator dataset, achieving a validation loss of 0.6753. It is intended for tasks requiring a fine-tuned Llama 2 base, with a context length of 4096 tokens.
Loading preview...