LorenaYannnnn/20260308-length_only-Qwen3-0.6B_grpo_baseline_192000_episodes_seed_42
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Mar 8, 2026Architecture:Transformer Warm
LorenaYannnnn/20260308-length_only-Qwen3-0.6B_grpo_baseline_192000_episodes_seed_42 is a 0.8 billion parameter language model developed by LorenaYannnnn. This model is based on the Qwen3 architecture and features a notable context length of 32768 tokens. It is specifically trained with a focus on length-only objectives, utilizing a GRPO baseline over 192,000 episodes. The model's primary differentiation lies in its specialized training for handling and generating content based on length constraints.
Loading preview...