amandaa/AutoL2S-Plus-7b
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Cold

amandaa/AutoL2S-Plus-7b is a 7.6 billion parameter model fine-tuned for efficient reasoning, building upon the AutoL2S framework. This model utilizes a two-stage training pipeline involving Long-Short Concatenated Distillation and off-policy Reinforcement Learning with a length-aware objective. It is optimized to generate shorter reasoning paths while maintaining correctness, making it suitable for tasks requiring concise and accurate logical deduction. The model is designed for improved reasoning efficiency compared to its base model.

Loading preview...