NorHsangPha/merge_llama3_adapter_Shan

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jul 8, 2024License:llama3Architecture:Transformer Warm

NorHsangPha/merge_llama3_adapter_Shan is an 8 billion parameter Llama 3 model, developed by Meta and further fine-tuned by NorHsangPha for the Shan language. This instruction-tuned model leverages an optimized transformer architecture and Grouped-Query Attention (GQA) for improved inference scalability. It is primarily designed for dialogue use cases, excelling in assistant-like chat, and has been adapted for Shan language tasks through fine-tuning on the oasst1_shan_translation dataset.

Loading preview...

Model Overview

NorHsangPha/merge_llama3_adapter_Shan is an 8 billion parameter Llama 3 model, originally developed by Meta and subsequently fine-tuned by NorHsangPha. This model is a merge that incorporates an adapter specifically trained for the Shan language, utilizing the NorHsangPha/llama3_adapter_shan and the oasst1_shan_translation dataset.

Key Capabilities

  • Shan Language Adaptation: Specifically fine-tuned to support and generate text in the Shan language, making it suitable for applications requiring this linguistic capability.
  • Llama 3 Architecture: Built upon Meta's Llama 3 family, featuring an optimized transformer architecture and Grouped-Query Attention (GQA) for efficient inference.
  • Instruction-Tuned: Optimized for dialogue and assistant-like chat use cases, outperforming many open-source chat models on common benchmarks.
  • Context Length: Supports an 8k token context length, allowing for processing and generating longer sequences of text.

Intended Use Cases

  • Shan Language Applications: Ideal for research and commercial use cases involving the Shan language, such as translation, content generation, or conversational AI.
  • Dialogue Systems: Well-suited for building chatbots and virtual assistants that require robust conversational abilities.
  • Natural Language Generation: Can be adapted for various natural language generation tasks, especially where Shan language support is critical.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p