atsuki-yamaguchi/Qwen2.5-7B-Instruct-am-madlad-mean-tuned
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Nov 22, 2024License:apache-2.0Architecture:Transformer Open Weights Cold
The atsuki-yamaguchi/Qwen2.5-7B-Instruct-am-madlad-mean-tuned model is a 7.6 billion parameter instruction-tuned language model based on Qwen2.5-7B-Instruct, specifically adapted for Amharic. It features an expanded vocabulary of 10,000 additional target language tokens, initialized using mean initialization. This model was continually pre-trained on 500 million Amharic tokens sampled from the MADLAD-400 dataset, making it specialized for Amharic language processing tasks.
Loading preview...