sminchoi/Llama-2-13b-chat-hf_guanaco-llama2-1k_230914_A6000
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

The sminchoi/Llama-2-13b-chat-hf_guanaco-llama2-1k_230914_A6000 model is a 13 billion parameter language model based on the Llama-2 architecture. It was fine-tuned using 4-bit quantization with the bitsandbytes library, specifically employing the nf4 quantization type. This model is optimized for chat-based applications, leveraging its Llama-2 foundation for conversational tasks. Its training methodology focuses on efficient resource utilization through quantization.

Loading preview...