Gille/StrangeMerges_52-7B-dare_ties
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Apr 1, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Gille/StrangeMerges_52-7B-dare_ties is a 7 billion parameter language model created by Gille, formed by merging several specialized models including WizardMath-7B-V1.1 and Einstein-v4-7B using the dare_ties method. This model is designed to leverage the strengths of its constituent models, particularly in mathematical reasoning and general language understanding. It achieves an average score of 73.51 on the Open LLM Leaderboard, with notable performance in HellaSwag and GSM8k benchmarks, making it suitable for tasks requiring robust reasoning capabilities.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p