Name: AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: AmberYifan

Overview

This model, llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft, is an 8 billion parameter language model developed by AmberYifan. It is a fine-tuned variant of the AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en base model, specifically optimized through supervised fine-tuning (SFT) on the alpaca_en dataset. This instruction-tuning process enhances its ability to follow commands and generate coherent, relevant responses based on given prompts.

Key Training Details

Base Model: AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en
Fine-tuning Dataset: alpaca_en
Learning Rate: 1e-05
Optimizer: AdamW with cosine learning rate scheduler
Epochs: 3.0
Total Batch Size: 128 (across 8 GPUs with gradient accumulation)

Intended Use Cases

This model is primarily intended for applications requiring instruction-following capabilities in English. Its fine-tuning on the Alpaca dataset suggests suitability for:

General-purpose chatbots
Question answering
Text generation based on specific instructions
Prototyping and development of conversational AI systems

Overview

Overview

Key Training Details

Intended Use Cases

Full Model Card (README)