chujiezheng/Llama3-8B-Chinese-Chat-ExPO
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 25, 2024License:llama3Architecture:Transformer0.0K Warm

The Llama3-8B-Chinese-Chat-ExPO model, developed by chujiezheng, is an 8 billion parameter language model with an 8192 token context length. It is an extrapolated (ExPO) version of the Llama3-8B-Chinese-Chat and Meta-Llama-3-8B-Instruct models, applying a novel extrapolation technique (alpha = 0.3) to enhance alignment with human preferences. This experimental model aims to improve performance in Chinese language tasks through its unique extrapolation method, showing win rate improvements on benchmarks like AlpacaEval 2.0 and MT-Bench.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p