CHIH-HUNG/llama-2-13b-FINETUNE1_17w-gate_up_down_proj
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Sep 3, 2023License:llama2Architecture:Transformer Open Weights Cold

The CHIH-HUNG/llama-2-13b-FINETUNE1_17w-gate_up_down_proj model is a 13 billion parameter Llama-2-based language model fine-tuned by CHIH-HUNG using the huangyt/FINETUNE1 dataset, comprising approximately 170,000 data points. This LoRA-tuned model specifically targets the gate_proj, up_proj, and down_proj attention layers. It demonstrates improved performance over the base Llama-2-13b model on benchmarks like HellaSwag, MMLU, and TruthfulQA, making it suitable for general language understanding and generation tasks.

Loading preview...