wang7776/Mistral-7B-Instruct-v0.2-sparsity-30-v0.1
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Jan 17, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
The wang7776/Mistral-7B-Instruct-v0.2-sparsity-30-v0.1 is a 7 billion parameter instruction-tuned causal language model, based on Mistral-7B-Instruct-v0.2, that has been pruned to 30% sparsity using the Wanda method. This pruning technique requires no retraining or weight updates, aiming to maintain competitive performance with reduced model size. It is designed for instruction-following tasks, leveraging grouped-query attention, sliding-window attention, and a byte-fallback BPE tokenizer.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–