haoranxu/ALMA-13B
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kPublished:Sep 17, 2023License:mitArchitecture:Transformer0.0K Open Weights Cold

ALMA-13B is a 13 billion parameter language model developed by Haoran Xu and collaborators, based on the LLaMA-2 architecture. It is specifically designed for machine translation, utilizing a two-step fine-tuning process involving monolingual data followed by high-quality parallel data. This model excels at translation tasks, offering strong performance in converting text between languages.

Loading preview...