Name: pkupie/Qwen2.5-3B-bo-cpt API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: pkupie

Qwen2.5-3B Continually Pretrained on Tibetan

This model, pkupie/Qwen2.5-3B-bo-cpt, is a 3.1 billion parameter language model that has undergone continual pretraining (CPT). It is built upon the robust Qwen2.5-3B architecture and has been further trained specifically on the Tibetan subset of the MC^2 Corpus.

Key Capabilities

Enhanced Tibetan Language Modeling: Optimized to improve performance and understanding of the Tibetan language.
Low-Resource Language Adaptation: Serves as a specialized checkpoint for research into adapting large language models to languages with limited data.

Good for

Research Purposes: Primarily intended for academic and research use, particularly in the field of low-resource NLP.
Base Model for Further Work: Suitable as a foundational model for experiments involving model merging and logit fusion techniques, as detailed in the associated research paper: "Efficient Low-Resource Language Adaptation via Multi-Source Dynamic Logit Fusion".
Tibetan Language Applications: Potentially useful for developing applications requiring strong Tibetan language capabilities.

Overview

Qwen2.5-3B Continually Pretrained on Tibetan

Key Capabilities

Good for

Full Model Card (README)