xxang/AStar-Thought-QwQ-32B
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:May 15, 2025License:otherArchitecture:Transformer0.0K Cold

The xxang/AStar-Thought-QwQ-32B is a 32.8 billion parameter language model developed by xxang, specifically fine-tuned using the A*-Thought framework. This model is optimized for efficient reasoning in low-resource settings by identifying and compressing essential thoughts from reasoning chains. It significantly improves accuracy and efficiency, particularly in scenarios with constrained inference budgets, and reduces response length without substantial accuracy drops.

Loading preview...