Name: shenzhi-wang/Llama3.1-8B-Chinese-Chat API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: shenzhi-wang

Model Overview

shenzhi-wang/Llama3.1-8B-Chinese-Chat is an 8 billion parameter instruction-tuned language model, developed by Shenzhi Wang and Yaowei Zheng, designed for both Chinese and English users. It is built upon the robust Meta-Llama-3.1-8B-Instruct base model and utilizes the ORPO fine-tuning algorithm. The model's training dataset includes over 100,000 preference pairs, leading to significant enhancements in specific areas.

Key Capabilities

Enhanced Roleplay: Demonstrates improved performance in role-playing scenarios.
Function Calling: Exhibits strong capabilities in function calling tasks.
Mathematical Reasoning: Shows significant improvements in handling mathematical problems.
Multilingual Support: Optimized for both Chinese and English language users.
Extended Context: Inherits the 128K token context length from its base model, though specifically untested for the Chinese model.

Training Details

The model was fine-tuned using the LLaMA-Factory framework over 3 epochs with full parameter tuning. It employs a cosine learning rate scheduler with a warmup ratio of 0.1 and an ORPO beta of 0.05. The cutoff length for training was 8192 tokens, with a global batch size of 128.

Good for

This model is particularly well-suited for applications requiring advanced conversational abilities, especially those involving role-playing, precise function execution, and mathematical problem-solving in both Chinese and English contexts. Its availability in official q4_k_m, q8_0, and f16 GGUF versions also makes it suitable for local deployment and inference using tools like LM Studio or llama.cpp.

Overview

Model Overview

Key Capabilities

Training Details

Good for

Full Model Card (README)