haoranxu/X-ALMA-13B-Group3
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Aug 23, 2024License:mitArchitecture:Transformer0.0K Open Weights Cold

haoranxu/X-ALMA-13B-Group3 is a 13 billion parameter multilingual language model developed by Haoran Xu et al., building on the ALMA-R architecture. It features a plug-and-play design with language-specific modules, supporting 50 languages, with this specific release optimized for English, Bulgarian, Macedonian, Serbian, Ukrainian, and Russian. The model is designed for quality translation at scale and multilingual open-ended QA, utilizing a carefully designed training recipe and a 4096-token context length.

Loading preview...