P9-split3_prob_Qwen3-4B-Base_0322-01
Llama-3-Lumimaid-70B-v0.1-alt-heretic
tmax_open_instruct_qwen3_4b_test
qwen2.5-1.5b-gsm8k-train-step0
qwen2.5-1.5b-gsm8k-train-step2000
c71-h26
HiTOP-MedGemma4B-merged
qwen2.5-1.5b-gsm8k-train-step2500
qwen2.5-1.5b-gsm8k-train-step3500
qwen2.5-1.5b-gsm8k-train-step4000
qwen2.5-1.5b-gsm8k-train-step7000
qwen2.5-1.5b-gsm8k-train-step8000
Openmed-icd10-rl-4b-lora-super-train-base
Openmed-icd10-rl-4b-lora-super-train-50
qwen3_4b_sudoku_one_act_rl_default_epoch3
qwen3_1.7b_sudoku_multi_action_group_norm_epoch3
sn38-3
c21
liarsdice-checkuplog-hashid
qwen3_1.7b_webshop_macro_action_epoch2
4b_sft_deepseek_reasoner_epoch3
open-dcoder-ablation-0.1-ctw0.1
qwen3_1.7b_webshop_macro_action
Forgotten-Transgression-24B-v4.1-ultra-uncensored-heretic
Venom-R1
Affine-P06-5GEpYa7mEQoXGkArn7QiTKiEv7ft5BFoJKBNc8DwXGqk6qnz
WooWoof_AI_Vision16Bit
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_airoboros
oh_v1_w_v3_camel_math_gpt-4o-mini
oh_v1-2_only_airoboros
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_alpaca
oh_v1-2_only_slim_orca
oh-dcft-v1.2_no-curation_gpt-4o-mini_wo_camel_chemistry
oh_v3-1_only_camel_biology
oh_v3-1_only_metamath_40k
oh_v3-1_only_sharegpt
oh_v3-1_only_camel_chemistry
oh_v3-1_only_gpteacher
oh_v1.3_camel_biology_x4
oh_v1.3_camel_chemistry_x2
oh_v3-1_only_evol_instruct_70k
oh_v1.3_camel_math_x.25