Deathsquad10/TinyLlama-repeat
TEXT GENERATIONConcurrency Cost:1Model Size:1.1BQuant:BF16Ctx Length:2kPublished:Jan 6, 2024License:apache-2.0Architecture:Transformer Open Weights Warm

Deathsquad10/TinyLlama-repeat is a 1.1 billion parameter Llama-architecture model, fine-tuned for chat applications. It adopts the same architecture and tokenizer as Llama 2, making it compatible with existing Llama-based open-source projects. This compact model is optimized for applications requiring restricted computation and memory footprints, excelling in conversational tasks.

Loading preview...