robinliubin/h2o-llama2-7b-4bits
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

The robinliubin/h2o-llama2-7b-4bits model is a 7 billion parameter Llama 2-based causal language model, fine-tuned using H2O LLM Studio. It is optimized for efficient deployment with 4-bit quantization, making it suitable for text generation tasks on resource-constrained hardware. This model leverages the h2oai/h2ogpt-4096-llama2-7b as its base, offering a 4096-token context window.

Loading preview...