CHIH-HUNG/llama-2-13b-FINETUNE2_3w-q_k_v_o_proj
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Sep 2, 2023License:llama2Architecture:Transformer Open Weights Cold

CHIH-HUNG/llama-2-13b-FINETUNE2_3w-q_k_v_o_proj is a 13 billion parameter Llama-2-based language model fine-tuned by CHIH-HUNG using the huangyt/FINETUNE2 dataset, comprising approximately 30,000 data entries. This model was fine-tuned with LoRA targeting the q_proj, k_proj, v_proj, and o_proj layers. It demonstrates competitive performance across benchmarks like ARC, HellaSwag, MMLU, and TruthfulQA compared to its base Llama-2-13b counterpart, with a context length of 4096 tokens.

Loading preview...