Name: RefalMachine/ruadapt_solar_10.7_part1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: RefalMachine

Model Overview

RefalMachine/ruadapt_solar_10.7_part1 is a fine-tuned language model derived from the SOLAR architecture. It was developed by RefalMachine and represents a specialized adaptation of its base model, solar_darulm_unigram_proj_init_17_01_24.

Training Details

The model underwent a single epoch of fine-tuning with a learning rate of 2e-05, utilizing a distributed training setup across 16 devices. Key hyperparameters included a train_batch_size of 1, eval_batch_size of 1, and a gradient_accumulation_steps of 8, resulting in an effective total_train_batch_size of 128. The optimizer used was Adam with betas=(0.9, 0.95) and epsilon=1e-05, employing a linear learning rate scheduler and native AMP for mixed-precision training.

Performance Metrics

During evaluation, the model achieved a final validation loss of 2.3397 and an accuracy of 0.5164. The training process showed a gradual decrease in validation loss and an increase in accuracy over 40,500 steps, indicating progressive learning.

Intended Uses & Limitations

Specific intended uses and limitations are not detailed in the provided information. Developers should conduct further evaluation to determine suitability for particular applications. The model's performance metrics suggest it may be applicable for tasks aligned with its fine-tuning dataset, though the dataset itself is not specified.

Overview

Model Overview

Training Details

Performance Metrics

Intended Uses & Limitations

Full Model Card (README)