pittawat/rl-scaling-rft-qwen-2.5-7b-instruct-grpo-baseline

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Jan 17, 2026Architecture:Transformer Cold

Loading preview...