p208p2002/llama-3-zhtw-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:llama3Architecture:Transformer0.0K Warm
The p208p2002/llama-3-zhtw-8B is an 8 billion parameter Llama 3-based language model developed by p208p2002, fine-tuned with 800M additional tokens for Traditional Chinese (zhtw) language capabilities. It maintains the original Llama 3's English MMLU performance due to its continued pre-training on FineWeb, while also incorporating Chinese and code datasets. This model is designed for applications requiring strong English language understanding alongside Traditional Chinese processing, offering a balanced performance profile.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p