dev_set_part1_10k_glm_4_7_traces_locetash
GLM-4_7-stackexchange-tezos-sandboxes-maxeps-131k
r2egym-bugsseq
affine-r1-5GuvXYRyZpYNe7hLTZpmuA6KVWcpgJrirShzXxRLGquqnFU6
qwen3-32B-V
Kimi-2-5-r2egym_sandboxes-maxeps-32k__Qwen3-8B
csrsef-thinking-20260323T195339Z-it01-pubmedqa
P2-split2_prob_Qwen3-8B-Base_0325-05-bs128-epoch6
Kimi-2.5-swesmith-r2egym-solved-maxeps-32k__Qwen3-8B
decompiler-v6
coderforge-100000-opt100k__Qwen3-8B
toolcalling-merged-demo
searchr1-repro-4b
BOOM_4B_eng_data_v1
Qwen3-8B-FengGe-SFT
Qwen3-0.6B-ZH-SynthDolly-1A-E8
Affine-e317-5FfAyn241ejB2MQufNX2eyHw8qzaAw7arZwP7Q6SPM9VodJe
qwen3-4B-instruct-refiner-sft
Qwen3-4B-PT-SynthDolly-1A-E8
Qwen3-1.7B-GRPO-KL-math-reasoning
NaijaPidgin-Qwen3-4B
Qwen3-4B-base-pira-ep3-qairm
qwen3_4b_thinking_2507_sft
Qwen3-4B-Instruct-2507-heretic
AfriqueQwen-14B-multiturn
QWEN3-4B-CPT
DMind-2-4B
MedSSR-Qwen3-8B-Base
diario-qwen3-1.7b-sft-v1-vllm
qwen3-it
diallm-qwen-gspo-all
QwenRolina3-06B-base-LR1e5-b32g2gc8-AR-Orig-order-batch
diallm-qwen-grpo-aus
g1_weighted_31600