LorenaYannnnn/general_reward-Qwen3-0.6B_7168-baseline_all_tokens-seed_0
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Apr 10, 2026Architecture:Transformer Cold

The LorenaYannnnn/general_reward-Qwen3-0.6B_7168-baseline_all_tokens-seed_0 is a 0.8 billion parameter language model with a 32768 token context length. This model is a general-purpose reward model, likely intended for evaluating and guiding the behavior of other language models. Its specific architecture and training details are not provided, but it is designed to serve as a baseline for reward signal generation.

Loading preview...