NuminaMath_Main_fixed_SFTanchor_1_5B_step_5
Qwen3-1.7B-ultrachat-bsz128-ts300-regular-skywork8b-seed42-lr1e-6-warmup10-checkpoint175
gptlong_continue_top8diverse100k_step600__Qwen3-32B
gemma-3-1b-lysiane-advanced-merged
g1_top8_gptlong_dist_31600_32b_step1410__Qwen3-32B
llama-3-8b-base-kto-ultrafeedback-8xh200
Qwen3-4B-it-pira-ep3-qairm-ptbr
bold_formatting-Qwen3-0.6B-OURS_self-seed_1
qwen-backward-lora2
phi-1.5-stage3-sft-cloned-seed100-merged
qwen2.5-0.5b-sft-new
acquisition_qwen3bins_medmcqa_answer_variance
merged-qwen-ta
phi-1.5-stage3-sft-cloned-seed999-merged
glm-muse-feral
qwen3-4B-refiner-3201-rl-balanced-step100
sonnet1
acquisition_qwen3bins_numina_format
qwen_16b_SFT
qwen-dapo-17k-vs-2
Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-1800steps
intuitor-sciknoweval_material-qwen3-4b-think-2507-r6k100
qwen_8b_SFT
csharp-clean-code-qwen-lora-merged
Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-600steps
qwen-backward-lora
gemma_2b_it_fintechb
Qwen2.5-Sex
fintech_gemma_2b
llama3.1_8b_base_only_sn_tuned_lr3e-5
Minmax_MUSE-News
TTRL-sciknoweval_material-TTRL-Len-8k-grpo-094908
affine-5Ccb12H25H5MXssy946rm4qxrQTmz5DH9M7DUG7W7ViioSGE
fintech_gemma_2b_prac2
tutorbot-dpo-merged
llama3.1_8b_base-Safety-FT-lr3e-5
llama-3.3-70b-atlas9-sdf-v5-balanced
qwen25-3b-somali
Llama3-OpenBioLLM-8B
gemma-2-9b-it-gsm8k-rsn-tuned-lr3e-5