CharlesLi/llama_3_gsm8k_llama_2
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Dec 31, 2024License:llama3.1Architecture:Transformer Cold

The CharlesLi/llama_3_gsm8k_llama_2 is an 8 billion parameter language model, fine-tuned from Meta's Llama-3.1-8B-Instruct. This model is optimized for specific tasks, demonstrating a final validation loss of 0.6028 after 30 training steps. It is intended for applications requiring a Llama-3.1-8B-Instruct base model with further specialized training.

Loading preview...