soulteary/Chinese-Llama-2-7b-4bit
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jul 22, 2023License:llama2Architecture:Transformer0.0K Open Weights Cold

The soulteary/Chinese-Llama-2-7b-4bit is a 7 billion parameter Llama 2-based model, developed by soulteary, specifically optimized for Chinese language processing. This model is a 4-bit quantized version of the Chinese LLaMA2 7B project, making it suitable for efficient deployment and inference in Chinese natural language understanding and generation tasks.

Loading preview...