Name: nbeerbower/mistral-nemo-gutenberg-12B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: nbeerbower

Model Overview

nbeerbower/mistral-nemo-gutenberg-12B is a 12 billion parameter language model derived from the mistralai/Mistral-Nemo-Instruct-2407 base model. It has been fine-tuned using an A100 GPU on Google Colab for one epoch, leveraging the jondurbin/gutenberg-dpo-v0.1 dataset. This fine-tuning process aims to enhance its instruction-following capabilities and general language understanding.

Key Characteristics

Base Model: Mistral-Nemo-Instruct-2407 architecture.
Parameter Count: 12 billion parameters.
Context Length: Supports a substantial context window of 32768 tokens.
Training Data: Fine-tuned on the gutenberg-dpo-v0.1 dataset, which likely contributes to its text generation and comprehension abilities.

Performance Metrics

Evaluated on the Open LLM Leaderboard, the model achieved an average score of 20.82. Specific benchmark results include:

IFEval (0-Shot): 35.04
BBH (3-Shot): 32.43
MMLU-PRO (5-shot): 28.47

Use Cases

This model is suitable for developers looking for a moderately sized language model with good instruction-following capabilities, particularly for tasks that benefit from a large context window. Its fine-tuning on a specific dataset suggests potential strengths in areas related to the dataset's content.

Overview

Model Overview

Key Characteristics

Performance Metrics

Use Cases

Full Model Card (README)