Name: jan-hq/supermario-slerp API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: jan-hq

Model Overview

jan-hq/supermario-slerp is a 7 billion parameter language model developed by jan-hq as a test project for model merging. It is a merge of two existing models, Seraph-7B and Marcoroni-7B-v3, utilizing the Slerp merge method. The base model for this merge is Mistral-7B-v0.1.

Key Capabilities & Performance

This model demonstrates general language understanding and reasoning abilities, as evaluated on the Open LLM Leaderboard. Its performance metrics include:

Avg. Score: 72.32
ARC (25-shot): 68.94
HellaSwag (10-shot): 86.58
MMLU (5-shot): 64.93
TruthfulQA (0-shot): 60.11
Winogrande (5-shot): 81.29
GSM8K (5-shot): 72.10

Detailed evaluation results are available on the Open LLM Leaderboard.

Intended Use

This model serves as an example of a merged model using the Slerp method. It can be run locally using Jan Desktop, an open-source, offline-first ChatGPT alternative. Jan Desktop provides a local server with OpenAI-compatible endpoints, ensuring privacy and control over conversations and model settings.

Overview

Model Overview

Key Capabilities & Performance

Intended Use

Full Model Card (README)