Name: mlfoundations-dev/openthoughts3_30k_llama3 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mlfoundations-dev

Overview

This model, openthoughts3_30k_llama3, is an 8 billion parameter language model developed by mlfoundations-dev. It is a fine-tuned variant of the meta-llama/Llama-3.1-8B-Instruct base model, leveraging its robust architecture and instruction-following capabilities. The fine-tuning process utilized the mlfoundations-dev/openthoughts3_30k dataset, indicating a potential specialization in generating or processing content related to "open thoughts" or specific knowledge domains represented in this dataset.

Training Details

The model was trained with a learning rate of 4e-05, a total batch size of 128 (across 32 devices with 4 gradient accumulation steps), and 5 epochs. It used the AdamW optimizer and a cosine learning rate scheduler with a 0.1 warmup ratio. The training environment included Transformers 4.46.1, Pytorch 2.3.0, Datasets 3.1.0, and Tokenizers 0.20.3.

Key Characteristics

Base Model: Meta Llama-3.1-8B-Instruct
Parameter Count: 8 billion
Context Length: 32768 tokens
Fine-tuning Dataset: mlfoundations-dev/openthoughts3_30k

Potential Use Cases

Given its fine-tuning on the openthoughts3_30k dataset, this model is likely well-suited for tasks that align with the nature of that dataset, potentially including:

Generating creative text or ideas
Summarizing or analyzing open-ended discussions
Applications requiring deep contextual understanding over long inputs.

Overview

Overview

Training Details

Key Characteristics

Potential Use Cases

Full Model Card (README)