Name: Mel-Iza0/Mistral-base-instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Mel-Iza0

Overview

Mel-Iza0/Mistral-base-instruct is a fine-tuned language model derived from the mistralai/Mistral-7B-Instruct-v0.1 base model. This model leverages the efficient 7 billion parameter Mistral architecture, which is recognized for its strong performance in various natural language processing tasks. The fine-tuning process involved specific hyperparameters, though the dataset used for this fine-tuning is not specified.

Key Capabilities

Instruction Following: As an instruction-tuned model, it is designed to understand and execute user instructions effectively.
General-Purpose Language Tasks: Inherits the broad capabilities of the Mistral-7B-Instruct-v0.1 base model, suitable for a range of text generation and understanding tasks.

Training Details

The model was trained using the following key hyperparameters:

Learning Rate: 0.0004
Batch Size: 2 (train), 8 (eval)
Gradient Accumulation: 2 steps, resulting in a total train batch size of 4
Optimizer: Adam with standard betas and epsilon
Scheduler: Constant with warmup (ratio 0.03)
Training Steps: 5
Mixed Precision: Native AMP was utilized for training efficiency.

Good for

Developers looking for a fine-tuned Mistral-7B-Instruct-v0.1 variant.
Experimentation with instruction-following models.
Applications requiring a balance of performance and computational efficiency from a 7B parameter model.

Overview

Overview

Key Capabilities

Training Details

Good for

Full Model Card (README)