VMXVMX/llama2-project
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

The VMXVMX/llama2-project is a 7 billion parameter language model based on the Llama 2 architecture. This model is developed by VMXVMX and utilizes PEFT for efficient fine-tuning. It is designed for general language understanding and generation tasks, offering a 4096-token context window.

Loading preview...