Name: mariiaponom/llama_13b_class API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mariiaponom

Model Overview

The mariiaponom/llama_13b_class model is a Llama-based language model, distinguished by its specific training methodology. The model was trained using advanced quantization techniques provided by the bitsandbytes library, indicating a focus on computational efficiency and reduced memory footprint during the training process.

Key Training Details

Quantization: The model leveraged bitsandbytes for 4-bit quantization during training, specifically using nf4 (NormalFloat 4-bit) quantization type.
Double Quantization: bnb_4bit_use_double_quant was enabled, further optimizing memory usage.
Compute Data Type: Training computations were performed using bfloat16, balancing precision with performance.
Framework: The PEFT (Parameter-Efficient Fine-Tuning) library, version 0.4.0, was utilized, suggesting that the model might have undergone fine-tuning with efficient adaptation methods.

Good For

Resource-constrained environments: Models trained with such quantization methods are often suitable for deployment or further fine-tuning in environments with limited GPU memory.
Understanding efficient training: This model serves as an example of applying advanced quantization techniques to large language models for more efficient training workflows.

Overview

Model Overview

Key Training Details

Good For

Full Model Card (README)