Name: prognosis/cardio-llama-2-7b-miniguanaco-v13 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: prognosis

Overview

The prognosis/cardio-llama-2-7b-miniguanaco-v13 is a Llama 2-based language model that has undergone fine-tuning with a focus on efficient resource utilization. The training process specifically employed 4-bit quantization, utilizing the nf4 quantization type and float16 for computation, which is a common strategy for reducing memory footprint and accelerating inference on compatible hardware.

Key Capabilities

Efficient Quantization: Trained with bitsandbytes 4-bit quantization (bnb_4bit_quant_type: nf4, bnb_4bit_compute_dtype: float16), making it suitable for environments with limited computational resources.
PEFT Integration: Leverages PEFT (Parameter-Efficient Fine-Tuning) version 0.4.0, indicating an efficient fine-tuning approach that minimizes the number of trainable parameters.

Good for

Resource-Constrained Deployment: Ideal for applications requiring a Llama 2-based model with a reduced memory footprint and faster inference due to 4-bit quantization.
Experimentation with Quantized Models: Provides a base for further research or application development involving highly quantized language models.

Overview

Overview

Key Capabilities

Good for

Full Model Card (README)