CHIH-HUNG/llama-2-13b-FINETUNE2_3w-gate_up_down_proj
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Sep 1, 2023License:llama2Architecture:Transformer Open Weights Cold

CHIH-HUNG/llama-2-13b-FINETUNE2_3w-gate_up_down_proj is a 13 billion parameter Llama-2-based language model fine-tuned by CHIH-HUNG using the huangyt/FINETUNE2 dataset, comprising approximately 30,000 data points. This model specifically targets the 'gate_proj', 'up_proj', and 'down_proj' attention layers for LoRA fine-tuning. It demonstrates improved performance on the MMLU and TruthfulQA benchmarks compared to the base Llama-2-13b model, making it suitable for tasks requiring enhanced reasoning and factual accuracy.

Loading preview...