sminchoi/Llama-2-13b-chat-hf_guanaco-llama2-1k_230914_A40
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

The sminchoi/Llama-2-13b-chat-hf_guanaco-llama2-1k_230914_A40 model is a 13 billion parameter language model based on the Llama-2-13b-chat-hf architecture, fine-tuned by sminchoi. It was trained using 4-bit quantization with the bitsandbytes library, specifically employing the nf4 quantization type. This model is designed for chat-based applications, leveraging its Llama-2 foundation and fine-tuning process to generate conversational responses.

Loading preview...