Dnoya10/dicoding_genAI_sft_eks1
Dnoya10/dicoding_genAI_sft_eks1 is a 1.5 billion parameter Qwen2.5 instruction-tuned causal language model developed by Dnoya10, fine-tuned from unsloth/qwen2.5-1.5b-instruct-unsloth-bnb-4bit. This model was trained using Unsloth and Huggingface's TRL library, enabling faster fine-tuning. It offers a 32768 token context length, making it suitable for tasks requiring extensive context understanding.
Loading preview...
Model Overview
Dnoya10/dicoding_genAI_sft_eks1 is a 1.5 billion parameter instruction-tuned language model, developed by Dnoya10. It is based on the Qwen2.5 architecture and was fine-tuned from unsloth/qwen2.5-1.5b-instruct-unsloth-bnb-4bit.
Key Characteristics
- Efficient Fine-tuning: This model was fine-tuned using Unsloth and Huggingface's TRL library, which allowed for a 2x faster training process compared to standard methods.
- Context Length: It supports a substantial context window of 32768 tokens, enabling it to process and generate longer sequences of text.
- License: The model is released under the Apache-2.0 license.
Potential Use Cases
Given its instruction-tuned nature and efficient training, this model is suitable for various applications, particularly those where a compact yet capable language model with a good context understanding is beneficial. Its faster fine-tuning process suggests it could be a good base for further domain-specific adaptations.