wang7776/Mistral-7B-Instruct-v0.2-sparsity-30-v0.1
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Jan 17, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

The wang7776/Mistral-7B-Instruct-v0.2-sparsity-30-v0.1 is a 7 billion parameter instruction-tuned causal language model, based on Mistral-7B-Instruct-v0.2, that has been pruned to 30% sparsity using the Wanda method. This pruning technique requires no retraining or weight updates, aiming to maintain competitive performance with reduced model size. It is designed for instruction-following tasks, leveraging grouped-query attention, sliding-window attention, and a byte-fallback BPE tokenizer.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p