ruwan/open-llama-sharded-3GB-7B-alpaca-vmware
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kLicense:apache-2.0Architecture:Transformer Open Weights Cold

The ruwan/open-llama-sharded-3GB-7B-alpaca-vmware model is a 7 billion parameter language model based on the Open Llama architecture, fine-tuned with Alpaca data. This model is sharded for efficient deployment, utilizing the original Open Llama tokenizer. It is designed for general-purpose language generation and understanding tasks, offering a balance of performance and resource efficiency.

Loading preview...