UWNSL/Qwen2.5-3B-Instruct_Long_CoT
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:Dec 22, 2024License:otherArchitecture:Transformer Warm

UWNSL/Qwen2.5-3B-Instruct_Long_CoT is a 3.1 billion parameter instruction-tuned causal language model, fine-tuned from Qwen/Qwen2.5-3B-Instruct. This model is specifically optimized for mathematical reasoning tasks, having been trained on the MATH_training_Qwen_QwQ_32B_Preview dataset. It is designed for applications requiring enhanced performance in solving complex mathematical problems.

Loading preview...