BKM1804/Qwen2-0.5B-Instruct-238eef0f-6d85-4b49-b057-e5bb0ed45a7f-dpo-tuned-merged
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:May 5, 2025License:apache-2.0Architecture:Transformer Open Weights Warm

BKM1804/Qwen2-0.5B-Instruct-238eef0f-6d85-4b49-b057-e5bb0ed45a7f-dpo-tuned-merged is an instruction-tuned Qwen2 model developed by BKM1804. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging the Qwen2 architecture for efficient performance.

Loading preview...