jan-hq/supermario-slerp-v2
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Dec 12, 2023License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
jan-hq/supermario-slerp-v2 is a 7 billion parameter language model created by Jan, utilizing the Slerp merge method to combine v1olet_marcoroni-go-bruins-merge-7B and juanako-7b-UNA. This model is a test project for exploring model merging techniques. It achieves an average score of 71.35 on the Open LLM Leaderboard, demonstrating capabilities across various reasoning and language understanding tasks within a 4096 token context window.
Loading preview...