Name: kei0902/fine-tuned-gemma API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kei0902

Model Overview

This model, kei0902/fine-tuned-gemma, is a 2.6 billion parameter language model derived from Google's gemma-2-2b-jpn-it architecture. It has undergone a fine-tuning process, though the specific dataset used for this fine-tuning is currently unknown.

Training Details

The fine-tuning procedure involved several key hyperparameters:

Learning Rate: 2e-05
Batch Size: A train_batch_size of 1 and eval_batch_size of 8, with a gradient_accumulation_steps of 8, resulting in a total_train_batch_size of 8.
Optimizer: AdamW with default betas and epsilon.
LR Scheduler: Linear type.
Epochs: Trained for 3 epochs.
Mixed Precision: Utilized native AMP for mixed-precision training.

Limitations

Due to the lack of detailed information regarding the fine-tuning dataset and specific objectives, the intended uses and limitations of this particular fine-tuned model are not clearly defined. Further evaluation and understanding of its performance characteristics would be required to determine optimal applications.

Overview

Model Overview

Training Details

Limitations

Full Model Card (README)