Name: mlfoundations-dev/OH_DCFT_V3_wo_gpt4_llm API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mlfoundations-dev

Overview

OH_DCFT_V3_wo_gpt4_llm is an 8 billion parameter language model developed by mlfoundations-dev. It is a fine-tuned variant of the meta-llama/Llama-3.1-8B base model, specifically trained on the mlfoundations-dev/OH_DCFT_V3_wo_gpt4_llm dataset.

Training Details

The model underwent 3 epochs of training with a learning rate of 5e-06, utilizing a total batch size of 512 across 16 GPUs. The training process achieved a final validation loss of 0.6373. Key hyperparameters included Adam optimizer with betas=(0.9, 0.999) and epsilon=1e-08, and a constant learning rate scheduler with a warmup ratio of 0.1.

Current Status

As of the current release, specific details regarding the model's intended uses, limitations, and comprehensive capabilities are not yet available in the provided documentation. Users are encouraged to consult future updates for more information on its performance characteristics and optimal applications.

Overview

Overview

Training Details

Current Status

Full Model Card (README)