haqishen/Llama-3-8B-Japanese-Instruct

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 23, 2024License:llama3Architecture:Transformer0.0K Warm

haqishen/Llama-3-8B-Japanese-Instruct is an 8 billion parameter Llama-3-8B-Instruct model fine-tuned by Qishen Ha specifically for Japanese conversational tasks. Utilizing the fujiki/japanese_hh-rlhf-49k dataset and LLaMA-Factory, this model excels at generating human-like responses in Japanese, supporting a context length of 8192 tokens. It is optimized for Japanese language understanding and generation, making it suitable for various Japanese-centric NLP applications.

Loading preview...

Model Overview

This model, haqishen/Llama-3-8B-Japanese-Instruct, is an 8 billion parameter variant of meta-llama/Meta-Llama-3-8B-Instruct that has been specifically fine-tuned for Japanese conversation. Developed by Qishen Ha, it leverages the fujiki/japanese_hh-rlhf-49k dataset, which is a Japanese human-preference dataset, to enhance its performance in Japanese language understanding and generation.

Key Capabilities

  • Japanese Conversational AI: Optimized for generating natural and contextually relevant responses in Japanese.
  • Instruction Following: Capable of following instructions and engaging in diverse conversational scenarios in Japanese.
  • Extended Context Length: Supports a maximum context length of 8192 tokens, allowing for more extensive and coherent dialogues.
  • Llama-3 Architecture: Built upon the robust Meta-Llama-3-8B-Instruct base model, ensuring strong foundational language capabilities.

Good For

  • Japanese Chatbots: Ideal for developing conversational agents that interact in Japanese.
  • Japanese Content Generation: Suitable for tasks requiring the generation of Japanese text, such as creative writing, summarization, or question answering.
  • Research and Development: Provides a strong baseline for further research into Japanese large language models and fine-tuning techniques.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p