sail/Sailor-4B
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 29, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Sailor-4B is a 4 billion parameter causal language model developed by sail, built upon the Qwen 1.5 architecture. It is specifically tailored for South-East Asian (SEA) languages, including Indonesian, Thai, Vietnamese, Malay, and Lao, with a context length of 32768 tokens. The model excels at understanding and generating text in these diverse linguistic landscapes, demonstrating proficiency in tasks like question answering and commonsense reasoning in SEA languages.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p