Maxtra/llama-2-7b-frestival
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

Maxtra/llama-2-7b-frestival is a Llama-2-7b-based language model developed by Maxtra. This model was trained using 4-bit quantization with the nf4 quantization type and float16 compute dtype. It leverages PEFT 0.4.0 for efficient fine-tuning. Its primary application is in scenarios requiring a Llama-2-7b base model with specific quantization configurations.

Loading preview...