martyn/llama2-megamerge-dare-13b-v2
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Dec 17, 2023License:llama2Architecture:Transformer Open Weights Cold

The martyn/llama2-megamerge-dare-13b-v2 is a 13 billion parameter language model based on the Llama-2 architecture, created by martyn. This model is a DARE merge of 17 different Llama-2 13B models, including those focused on code, mathematics, and instruction following, resulting in a model that generalizes instruct styles. With a 4096-token context length, it is designed for diverse conversational and task-oriented applications.

Loading preview...