Zheng-Zong/AronaR1-SFT-stage1-v2-checkpoint500
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 17, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The Zheng-Zong/AronaR1-SFT-stage1-v2-checkpoint500 is a 7.6 billion parameter Qwen2-based instruction-tuned language model developed by Zheng-Zong, fine-tuned from unsloth/Qwen2.5-Math-7B-Instruct. This model was trained using Unsloth and Huggingface's TRL library, focusing on efficient fine-tuning. With a 32768 token context length, it is optimized for tasks requiring robust instruction following and potentially mathematical reasoning, given its base model.

Loading preview...