Name: shenzhi-wang/Llama3.1-70B-Chinese-Chat API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: shenzhi-wang

Overview

shenzhi-wang/Llama3.1-70B-Chinese-Chat is a 70 billion parameter instruction-tuned language model developed by Shenzhi Wang and Yaowei Zheng, built upon the Meta-Llama-3.1-70B-Instruct base model. It is specifically fine-tuned for both Chinese and English users, enhancing its utility in bilingual environments. The model utilizes the ORPO fine-tuning algorithm, a reference-free monolithic preference optimization method, to achieve its specialized capabilities.

Key Capabilities

Bilingual Support: Optimized for both Chinese and English users.
Enhanced Roleplay: Demonstrates significant improvements in roleplaying scenarios.
Function Calling: Features enhanced capabilities for function calling, useful for tool-use applications.
Mathematical Proficiency: Exhibits improved performance in mathematical tasks.
Context Length: Supports a context length of 32,768 tokens, inherited from its base model.

Training Details

The model was fine-tuned over 3 epochs with a learning rate of 1.5e-6 and a cosine learning rate scheduler. It used a cutoff length of 8192 and an ORPO beta of 0.05, with full parameter fine-tuning and a paged_adamw_32bit optimizer. The training dataset included over 100,000 preference pairs.

Good for

Applications requiring strong bilingual (Chinese/English) conversational abilities.
Use cases involving complex roleplaying or character interactions.
Scenarios where robust function calling and tool integration are necessary.
Tasks demanding improved mathematical reasoning and problem-solving.

Overview

Overview

Key Capabilities

Training Details

Good for

Full Model Card (README)