elmosiussuli/qwen2.5-1.5b-indonesian-grpo-pgabl

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:May 8, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The elmosiussuli/qwen2.5-1.5b-indonesian-grpo-pgabl is a 1.5 billion parameter Qwen2.5 model, fine-tuned by elmosiussuli from an Indonesian SFT base model. Optimized for speed using Unsloth and Huggingface's TRL library, it is designed for Indonesian language tasks. This model offers a compact yet capable solution for applications requiring efficient Indonesian language processing within a 32768 token context window.

Loading preview...

Model Overview

The elmosiussuli/qwen2.5-1.5b-indonesian-grpo-pgabl is a 1.5 billion parameter language model developed by elmosiussuli. It is a fine-tuned variant of the Qwen2.5 architecture, specifically adapted from an existing Indonesian SFT (Supervised Fine-Tuning) base model, elmosiussuli/qwen2.5-1.5b-indonesian-sft-pgabl.

Key Characteristics

  • Architecture: Qwen2.5 base model.
  • Parameter Count: 1.5 billion parameters, offering a balance between performance and computational efficiency.
  • Language Focus: Primarily designed and fine-tuned for Indonesian language processing.
  • Training Optimization: The model's training process was accelerated by a factor of two using Unsloth and Huggingface's TRL (Transformer Reinforcement Learning) library, indicating an emphasis on efficient development and deployment.
  • Context Length: Supports a context window of 32768 tokens.

Intended Use Cases

This model is well-suited for applications requiring efficient and specialized Indonesian language understanding and generation. Its optimized training suggests it could be beneficial for scenarios where rapid iteration or deployment on resource-constrained environments is important, particularly for tasks within the Indonesian linguistic domain.