p208p2002/llama-3-zhtw-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:llama3Architecture:Transformer0.0K Warm

The p208p2002/llama-3-zhtw-8B is an 8 billion parameter Llama 3-based language model developed by p208p2002, fine-tuned with 800M additional tokens for Traditional Chinese (zhtw) language capabilities. It maintains the original Llama 3's English MMLU performance due to its continued pre-training on FineWeb, while also incorporating Chinese and code datasets. This model is designed for applications requiring strong English language understanding alongside Traditional Chinese processing, offering a balanced performance profile.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p