imone/deprecated_bf16_LLaMA2_13B_with_EOT_token
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kLicense:llama2Architecture:Transformer0.0K Open Weights Cold

The imone/deprecated_bf16_LLaMA2_13B_with_EOT_token is a 13 billion parameter Llama 2 model, modified to include an End-of-Turn (EOT) token and a PAD token. These additions, at IDs 32000 and 32001 respectively, are initialized with the mean of existing token embeddings. This model is specifically adapted for tasks requiring explicit turn demarcation, enhancing conversational AI applications.

Loading preview...