Gille/StrangeMerges_51-7B-dare_ties
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Apr 1, 2024License:apache-2.0Architecture:Transformer Open Weights Cold
Gille/StrangeMerges_51-7B-dare_ties is a 7 billion parameter language model created by Gille, formed by merging several specialized models including WizardMath-7B-V1.1, NeuralCoder-7b, and Einstein-v4-7B using the dare_ties method. This merge aims to combine strengths in mathematical reasoning, code generation, and general language understanding. It is designed for diverse applications requiring a blend of these capabilities, offering a context length of 8192 tokens.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p