Name: TheBloke/gpt4-alpaca-lora-13B-HF API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TheBloke

Model Overview

This model, TheBloke/gpt4-alpaca-lora-13B-HF, is a 13 billion parameter LLaMA-based language model. It is a merged version of chansung's gpt4-alpaca-lora-13b, which was fine-tuned to operate as a chatbot.

Key Capabilities

Chatbot Functionality: The model is specifically fine-tuned with a LoRA checkpoint to excel in instruction-following and conversational tasks, making it suitable for chatbot applications.
LLaMA Architecture: Built upon the LLaMA base model, it inherits the foundational capabilities of that architecture.
Instruction Following: The fine-tuning process utilized an instruction-following dataset (alpaca_data_gpt4.json) to enhance its ability to respond to user prompts effectively.

Training Details

The original LoRA checkpoint was generated through a fine-tuning process using a script adapted from the official Alpaca-LoRA implementation. The training involved 10 epochs with a cutoff length of 512 tokens. Further details on the training can be found in the Weights & Biases report here.

Good For

Developing conversational AI agents.
Applications requiring instruction-tuned language model responses.
Experimenting with LLaMA-based chatbot models.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)