Name: jiwon9703/KoLlama-3.1-8B-Instruct-qlora-sft-DDP-v0 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: jiwon9703

Model Overview

KoLlama-3.1-8B-Instruct-qlora-sft-DDP-v0 is an 8 billion parameter instruction-tuned language model developed by jiwon9703. It is built upon the robust meta-llama/Llama-3.1-8B-Instruct base model and has been specifically fine-tuned for Korean and English language tasks.

Key Features & Training Details

Base Model: Llama-3.1-8B-Instruct, providing a strong foundation for general language understanding.
Fine-tuning Method: Utilizes QLoRA (Quantized Low-Rank Adaptation) with 4-bit NF4 quantization and double quantization enabled, making it efficient for deployment.
Distributed Training: Trained using Distributed Data Parallel (DDP) across 4 NVIDIA GPUs, indicating a robust and scalable training process.
Language Focus: Primarily instruction-tuned on a dedicated Korean SFT dataset (ko_combined_sft_dataset.json), enhancing its performance in Korean language generation and comprehension, while also supporting English.
LoRA Configuration: Employs a LoRA rank (r) of 8 and alpha of 16, targeting key attention and feed-forward modules (q_proj, k_proj, v_proj, o_proj, up_proj, down_proj, gate_proj).
Context Length: Supports a maximum sequence length of 4096 tokens during training.

Intended Use Cases

This model is well-suited for applications requiring instruction-following capabilities in both Korean and English. Its fine-tuning on Korean datasets makes it a strong candidate for:

Korean language generation and understanding.
Bilingual chatbots and conversational AI systems.
Text summarization and translation tasks involving Korean.
General instruction-following tasks in both languages.

Limitations

As with its base model, KoLlama-3.1-8B-Instruct-qlora-sft-DDP-v0 may exhibit limitations such as generating incorrect or biased information. It is not recommended for critical applications without human oversight.

Overview

Model Overview

Key Features & Training Details

Intended Use Cases

Limitations

Full Model Card (README)