Name: shleeeee/mistral-ko-7b-wiki-neft API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: shleeeee

Overview

The shleeeee/mistral-ko-7b-wiki-neft is a 7 billion parameter language model, fine-tuned from the Mistral-7B-v0.1 architecture. Developed by shleeeee (Seunghyeon Lee) and oopsung (Sungwoo Park), this model is specifically enhanced for the Korean language.

Key Characteristics

Base Model: Mistral-7B-v0.1.
Language Focus: Optimized for Korean using a custom Korean dataset.
Fine-tuning Method: Incorporates NEFT (Noise-Enhanced Fine-Tuning) with a neftune_noise_alpha of 5.
LoRA Target Modules: Fine-tuned using LoRA on q_proj, k_proj, v_proj, o_proj, and gate_proj modules.
Training Details: Trained for 1000 steps with a train_batch size of 4.

Usage and Evaluation

The model utilizes the standard Mistral prompt template: <s>[INST]{['instruction']}[/INST]{['output']}</s>. While specific benchmark scores are not detailed, an evaluation image is provided in the original model card, indicating performance assessment. This model is suitable for various Korean natural language processing tasks, including text generation and comprehension.

Overview

Overview

Key Characteristics

Usage and Evaluation

Full Model Card (README)