vanillaOVO/supermario_v3
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 31, 2024License:apache-2.0Architecture:Transformer Open Weights Cold

vanillaOVO/supermario_v3 is a 7 billion parameter language model created by vanillaOVO, based on a merge of pre-trained models using the DARE method. This model is built upon the Mistral architecture and is designed for causal language modeling tasks. Its primary differentiator lies in its construction via model merging techniques, aiming to combine strengths from various base models. It supports a context length of 4096 tokens.

Loading preview...