namezz/lvm-instruct-0327-a-qwen2.5-7b-instruct-b-qwen2.5-1.5b-instruct
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Mar 27, 2026License:otherArchitecture:Transformer Loading

This model is a fine-tuned version of Qwen/Qwen2.5-1.5B-Instruct, developed by namezz, featuring 1.5 billion parameters and a 32768 token context length. It has been specifically fine-tuned on the 7b_instruction_100k_16_train dataset, demonstrating a final validation loss of 0.0037. This specialization suggests its primary utility in instruction-following tasks, leveraging the Qwen2.5 architecture for enhanced performance in specific applications.

Loading preview...