Name: Weni/WeniGPT-Mistral-7B-instructBase API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Weni

WeniGPT-Mistral-7B-instructBase Overview

WeniGPT-Mistral-7B-instructBase is an instruction-tuned language model developed by Weni, based on the robust mistralai/Mistral-7B-Instruct-v0.1 architecture. While specific details about the fine-tuning dataset are not provided, the model is designed to follow instructions effectively, building upon the strong foundation of the original Mistral 7B Instruct model.

Training Details

The model underwent a fine-tuning process using the following key hyperparameters:

Learning Rate: 0.0004
Optimizer: Adam with betas=(0.9, 0.999) and epsilon=1e-08
Training Steps: 8000
Batch Size: A train_batch_size of 2, with gradient_accumulation_steps of 2, resulting in a total_train_batch_size of 4.
Mixed Precision: Native AMP was utilized for training efficiency.

Intended Use Cases

Given its instruction-tuned nature and foundation on Mistral-7B-Instruct-v0.1, this model is generally suitable for a variety of natural language processing tasks that require understanding and executing commands. While specific use cases are not detailed in the provided information, it can be inferred to perform well in areas such as:

Question answering
Text summarization
Content generation based on prompts
Chatbot applications requiring instruction following

Further evaluation would be needed to determine its specific strengths and limitations compared to other instruction-tuned models.

Overview

WeniGPT-Mistral-7B-instructBase Overview

Training Details

Intended Use Cases

Full Model Card (README)