CharlesLi/llama_2_cot_simplest_alpaca_4_3_epoch_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 21, 2025License:llama2Architecture:Transformer Open Weights Cold
CharlesLi/llama_2_cot_simplest_alpaca_4_3_epoch_full is a 7 billion parameter language model fine-tuned from Meta's Llama-2-7b-chat-hf. This model was trained for 3 epochs on a generator dataset, achieving a validation loss of 1.0590. It is intended for general conversational tasks, leveraging the Llama 2 architecture with a 4096 token context length.
Loading preview...