Radu1999/Mistral-Instruct-Ukrainian-SFT is a 7 billion parameter instruction-tuned language model developed by Radu Chivereanu, based on the Mistral-7B-v0.2 architecture. This model is specifically fine-tuned on various Ukrainian datasets, including UA-SQUAD and Ukrainian StackExchange, to enhance its performance in the Ukrainian language. It utilizes Grouped-Query Attention, Sliding-Window Attention, and a Byte-fallback BPE tokenizer. The model is optimized for instruction-following tasks in Ukrainian, making it suitable for applications requiring natural language understanding and generation in this specific language.
Loading preview...
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.