hamishivi/swerl_qwen3_8b_our_sft_tmax_10k_grpo_step500

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:May 14, 2026Architecture:Transformer Warm

Loading preview...