Zheng-Zong/AronaR1-SFT-stage2-v2

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Mar 19, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

Zheng-Zong/AronaR1-SFT-stage2-v2 is a 7.6 billion parameter Qwen2-based causal language model developed by Zheng-Zong, fine-tuned from AronaR1-SFT-stage1-v2. This model was trained using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general language generation tasks with a context length of 32768 tokens.

Loading preview...

Model Overview

Zheng-Zong/AronaR1-SFT-stage2-v2 is a 7.6 billion parameter language model built upon the Qwen2 architecture. It represents the second stage of fine-tuning, evolving from the Zheng-Zong/AronaR1-SFT-stage1-v2 base model.

Key Characteristics

  • Architecture: Based on the Qwen2 model family.
  • Parameter Count: Features 7.6 billion parameters, offering a balance between performance and computational efficiency.
  • Training Efficiency: This model was fine-tuned with significant speed improvements, utilizing the Unsloth library in conjunction with Huggingface's TRL library, resulting in a 2x faster training process.
  • Context Length: Supports a substantial context window of 32768 tokens, allowing for processing longer inputs and generating more coherent, extended outputs.

Intended Use Cases

This model is suitable for a variety of general-purpose natural language processing tasks, benefiting from its efficient training and robust architecture. Its large context window makes it particularly useful for applications requiring understanding and generation over longer texts.