bobofrut/ladybird-base-7B-v8
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Mar 23, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
Ladybird-base-7B-v8 is a 7 billion parameter Large Language Model developed by bobofrut, built upon the efficient Mistral architecture with a 4096-token context length. It incorporates Grouped-Query Attention, Sliding-Window Attention, and a Byte-fallback BPE Tokenizer to enhance performance and language coverage. This model is designed for complex language understanding and generation tasks, demonstrating strong performance across various benchmarks including Winogrande, TruthfulQA, and GSM8K.
Loading preview...