anmolagarwal999/llama_on_bigbench
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

The anmolagarwal999/llama_on_bigbench model is a Llama-based language model developed by anmolagarwal999. This model was trained using bitsandbytes 8-bit quantization, indicating an optimization for reduced memory footprint during training and inference. The training process utilized PEFT 0.6.0.dev0 framework versions. Its primary characteristic is the application of 8-bit quantization for efficient deployment and operation.

Loading preview...