cretone/ultron_storm_sft_20231210
TEXT GENERATIONConcurrency Cost:1Model Size:1.1BQuant:BF16Ctx Length:2kArchitecture:Transformer Gated Cold
Ultron_storm_sft_20231210 is a 1.1 billion parameter large language model from the Ultron series, developed by cretone. It utilizes Grouped Query Attention and has a sequence length of 2048 tokens. Trained on a substantial 950 billion token dataset, this model is part of a larger family of LLMs ranging from 160M to 1.1B parameters.
Loading preview...