Name: mlfoundations-dev/oh_v1_w_v3_camel_math_gpt-4o-mini API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mlfoundations-dev

Model Overview

The mlfoundations-dev/oh_v1_w_v3_camel_math_gpt-4o-mini is an 8 billion parameter language model derived from the Meta Llama 3.1-8B architecture. It has undergone fine-tuning on the mlfoundations-dev/oh_v1_w_v3_camel_math_gpt-4o-mini dataset, indicating a specialized focus on mathematical reasoning and problem-solving tasks.

Key Characteristics

Base Model: Fine-tuned from meta-llama/Llama-3.1-8B.
Parameter Count: 8 billion parameters.
Context Length: Supports a context length of 32768 tokens.
Optimization: Specifically trained on a dataset geared towards mathematical content, suggesting enhanced performance in numerical and logical operations.
Performance: Achieved a final validation loss of 0.5828 during training.

Training Details

The model was trained with a learning rate of 5e-06, using Adam optimizer with betas=(0.9, 0.999) and epsilon=1e-08. It utilized a distributed training setup across 16 devices with a total batch size of 512, over 3 epochs. The training process included a constant learning rate scheduler with a warmup ratio of 0.1.

Intended Use Cases

This model is particularly well-suited for applications requiring strong mathematical understanding and generation. Potential use cases include:

Solving mathematical problems.
Generating explanations for mathematical concepts.
Assisting in data analysis and quantitative reasoning tasks.

Due to its specialized fine-tuning, it is expected to perform effectively in environments where precise numerical and logical processing is critical.

Overview

Model Overview

Key Characteristics

Training Details

Intended Use Cases

Full Model Card (README)