Name: janakhpon/mon-lm-qwen2.5-1.5b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: janakhpon

Mon-LM (Qwen2.5-1.5B) Overview

janakhpon/mon-lm-qwen2.5-1.5b is a specialized Large Language Model (LLM) with 1.5 billion parameters, built upon the robust Qwen2.5 architecture. Its primary distinction lies in its dedicated focus on the Mon language (mnw).

Key Capabilities & Features

Mon Language Specialization: The model has undergone Continual Pre-Training (CPT) using QLoRA on an extensive Mon language corpus, making it highly proficient in Mon.
Expanded Tokenizer: The base Qwen2.5 tokenizer has been significantly expanded to include approximately 3,000 Mon-specific tokens (SentencePiece Unigram), optimizing its understanding and generation of Mon text. This expansion involved injecting Mon subwords into the embedding layer to improve compression ratio and linguistic atomicity.
NFC Normalization: All Mon text processed during training was NFC normalized, ensuring consistent character representation.
Context Length: It supports a substantial context length of 32768 tokens, allowing for processing longer Mon texts.

Use Cases

This model is particularly well-suited for applications requiring deep understanding and generation of the Mon language. Potential use cases include:

Mon language translation systems.
Content generation in Mon.
Mon language research and linguistic analysis.
Educational tools for Mon speakers or learners.

Developed as part of the Mon Language AI initiative, this model represents a significant step forward for Mon language technology.

Overview

Mon-LM (Qwen2.5-1.5B) Overview

Key Capabilities & Features

Use Cases

Full Model Card (README)