CharlesLi/llama_2_cot_simplest_code_math_1_3_epoch_full
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 21, 2025License:llama2Architecture:Transformer Open Weights Cold

The CharlesLi/llama_2_cot_simplest_code_math_1_3_epoch_full is a 7 billion parameter Llama-2-7b-chat-hf model fine-tuned by CharlesLi. This model is specifically trained on a generator dataset, achieving a loss of 0.6809 on its evaluation set. It is intended for tasks related to code and mathematics, leveraging its Llama 2 base for conversational and reasoning capabilities.

Loading preview...