Bllossom/llama-3-Korean-Bllossom-70B

Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:8kPublished:May 8, 2024License:llama3Architecture:Transformer0.1K Warm

Bllossom/llama-3-Korean-Bllossom-70B is a 70.8 billion parameter Korean-English bilingual language model developed by MLPLab at Seoultech, Teddysum, and Yonsei University, based on the Llama-3 architecture. It features significant Korean vocabulary expansion (over 30,000 words) and enhanced Korean context processing, allowing for approximately 25% longer Korean context length compared to Llama-3. The model is fine-tuned with custom data considering Korean culture and language, and incorporates knowledge linking through parallel Korean-English corpora, making it highly effective for Korean-centric bilingual applications.

Loading preview...

Bllossom/llama-3-Korean-Bllossom-70B Overview

Bllossom/llama-3-Korean-Bllossom-70B is a 70.8 billion parameter Korean-English bilingual language model built upon the Llama-3 architecture. Developed through a collaboration between MLPLab at Seoultech, Teddysum, and Yonsei University, this model is specifically designed to enhance Korean language processing capabilities while maintaining strong English proficiency.

Key Capabilities

  • Extensive Korean Vocabulary Expansion: Features over 30,000 expanded Korean vocabulary words, significantly improving Korean expressiveness.
  • Enhanced Korean Context Handling: Processes approximately 25% longer Korean context lengths compared to the base Llama-3 model.
  • Bilingual Knowledge Linking: Utilizes Korean-English parallel corpora for pre-training to establish robust knowledge connections between the two languages.
  • Culturally Aligned Instruction Tuning: Fine-tuned with custom instruction-following data tailored to Korean language nuances and cultural contexts, developed by linguists.
  • Reinforcement Learning (DPO): Incorporates Human Feedback (DPO) for improved model performance and alignment.
  • Vision-Language Alignment: The Bllossom project also includes a vision-language model (Bllossom-V) that aligns vision transformers with this language model.

Ideal Use Cases

  • Applications requiring high-quality Korean language generation and understanding.
  • Bilingual Korean-English tasks, leveraging its knowledge linking capabilities.
  • Developing custom models that benefit from a strong Korean linguistic foundation and commercial use is permitted.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p