gardner/TinyLlama-1.1B-Instruct-3T
TEXT GENERATIONConcurrency Cost:1Model Size:1.1BQuant:BF16Ctx Length:2kPublished:Jan 20, 2024License:apache-2.0Architecture:Transformer Open Weights Warm

gardner/TinyLlama-1.1B-Instruct-3T is a 1.1 billion parameter Llama-derived instruction-tuned model, based on the TinyLlama intermediate step 1431k-3T base model. It was fine-tuned on the OpenHermes instruct dataset for four epochs, making it specifically designed as a foundational model for further fine-tuning on instruction-following tasks. This model offers a compact yet capable base for developing specialized conversational AI applications.

Loading preview...