hfl/llama-3-chinese-8b-instruct-v3

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 28, 2024License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

The hfl/llama-3-chinese-8b-instruct-v3 is an 8 billion parameter instruction-tuned causal language model developed by hfl, further fine-tuned from a mix of Llama-3-Chinese-8B-Instruct, Llama-3-Chinese-8B-Instruct-v2, and Meta-Llama-3-8B-Instruct. This model is designed for conversational AI, question answering, and general instruction-following tasks, leveraging its 8192-token context length for robust Chinese language processing. It specializes in chat-based interactions, providing a capable foundation for applications requiring detailed responses in Chinese.

Loading preview...

Llama-3-Chinese-8B-Instruct-v3 Overview

The hfl/llama-3-chinese-8b-instruct-v3 is an 8 billion parameter instruction-tuned language model, building upon the foundation of Meta's Llama-3-8B-Instruct. Developed by hfl, this version is a further fine-tuned iteration, incorporating enhancements from previous Chinese-specific Llama-3 instruction models (hfl/Llama-3-Chinese-8B-Instruct and hfl/Llama-3-Chinese-8B-Instruct-v2). It is specifically designed as a chat model, optimized for interactive conversations and instruction-following in Chinese.

Key Capabilities

  • Instruction Following: Excels at understanding and executing a wide range of user instructions.
  • Conversational AI: Optimized for multi-turn dialogues and chat-based applications.
  • Question Answering: Capable of providing detailed and relevant answers to queries.
  • Chinese Language Processing: Enhanced for performance in Chinese language contexts, leveraging its fine-tuning history.
  • Context Length: Supports an 8192-token context window, allowing for more extensive conversations and complex prompts.

Good For

  • Developing Chinese-speaking chatbots and virtual assistants.
  • Applications requiring instruction-based text generation in Chinese.
  • Research and development in Chinese natural language understanding and generation.

Users should adhere to the Llama-3 open-source license agreement.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p