Name: wang7776/vicuna-7b-v1.3-sparsity-10 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: wang7776

Overview

This model, wang7776/vicuna-7b-v1.3-sparsity-10, is a 7 billion parameter variant of the Vicuna v1.3 model, developed by LMSYS. It is an auto-regressive language model fine-tuned from LLaMA on approximately 125K user-shared conversations from ShareGPT.com. A key differentiator of this specific model is its application of the Wanda pruning method, achieving 10% sparsity without requiring retraining or weight updates, while aiming to preserve performance.

Key Capabilities

Chat Assistant: Functions as a chat assistant, fine-tuned on conversational data.
Sparsity: Incorporates 10% sparsity via Wanda pruning, potentially offering efficiency benefits.
Research Tool: Primarily designed for research and development in large language models and chatbots.

Good For

LLM Research: Ideal for researchers and hobbyists exploring sparse models and their performance characteristics.
Chatbot Development: Suitable for experimenting with conversational AI applications based on the Vicuna architecture.
Efficiency Studies: Useful for investigating the impact of pruning techniques like Wanda on model size and inference efficiency without significant performance degradation.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)