LorenaYannnnn/general_reward-Qwen3-0.6B-OURS_self-seed_0
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Mar 17, 2026Architecture:Transformer Warm

The LorenaYannnnn/general_reward-Qwen3-0.6B-OURS_self-seed_0 is a 0.8 billion parameter language model with a 32768 token context length. This model is a reward model, likely designed to evaluate the quality of responses from other language models. Its specific architecture and training details are not provided, but it is intended for general reward modeling tasks.

Loading preview...