chhao/Weak-Driven-Learning
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 11, 2026License:mitArchitecture:Transformer0.0K Open Weights Warm

chhao/Weak-Driven-Learning is a 4.0 billion parameter causal language model based on the Qwen3 architecture, developed by Zehao Chen et al. It utilizes a novel post-training paradigm that leverages weaker historical model checkpoints as error signals to drive continuous improvement. This framework enables consistent performance gains on mathematical reasoning and code generation tasks without incurring additional inference cost, making it suitable for resource-constrained environments.

Loading preview...