Name: Tensoic/Llama-2-7B-alpaca-2k-test-merged API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Tensoic

Model Overview

Tensoic/Llama-2-7B-alpaca-2k-test-merged is a 7 billion parameter language model derived from Meta's Llama-2-7b-hf base model. It has been fine-tuned by Tensoic using the PEFT (Parameter-Efficient Fine-Tuning) LoRA method on the henrichsen/alpaca_2k_test dataset. This fine-tuning process aims to enhance the model's ability to follow instructions and generate responses aligned with the Alpaca instruction format.

Key Characteristics

Base Model: Llama-2-7b-hf
Parameter Count: 7 billion
Fine-tuning Method: LoRA (Low-Rank Adaptation) with lora_r: 32 and lora_alpha: 16
Dataset: henrichsen/alpaca_2k_test
Context Length: Configured for a sequence length of 4096 tokens
Quantization: Trained with load_in_8bit: true using bitsandbytes for memory efficiency.

Training Details

The model was trained for 3 epochs on 8x NVIDIA V100 GPUs (32GB each) with a micro batch size of 2 and a learning rate of 0.0002. Gradient accumulation steps were set to 4, and gradient_checkpointing was enabled. The training utilized fp16 precision and xformers_attention for optimized performance.

Overview

Model Overview

Key Characteristics

Training Details

Full Model Card (README)