Name: vmajor/Orca2-13B-selfmerge-26B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: vmajor

Model Overview

vmajor/Orca2-13B-selfmerge-26B is a 13 billion parameter language model created by applying a self-merge operation to the microsoft/Orca-2-13b model using mergekit-legacy with specific parameters (--weight 0.5 --density 0.5). This merging technique aims to enhance model performance through architectural recombination.

Key Performance Improvements

This self-merged model shows a slight improvement in perplexity, moving from 7.595 to 7.550. More significantly, benchmark results indicate a substantial gain in mathematical reasoning:

GSM8K: Performance more than doubled, increasing from 17.97 to 39.2.
Overall Average: The model achieved an average score of 62.24 across various benchmarks, compared to the base model's 58.64.

Other benchmarks like ARC, HellaSwag, MMLU, TruthfulQA, and Winogrande show minor changes, with some slight improvements and some negligible dips. The most notable differentiator is the significant boost in GSM8K, suggesting enhanced arithmetic and logical problem-solving capabilities.

Use Cases

This model is particularly well-suited for applications where improved mathematical reasoning and problem-solving are critical, building upon the strong foundation of the Orca-2-13B architecture.

Overview

Model Overview

Key Performance Improvements

Use Cases

Full Model Card (README)