Qwen2.5-1.5B-SFT-Schwinn
qwen3-1.7b-bilingual-amr-sft-v3
unsup-Qwen3-1.7B-datav3
gemma3_1B_base-tr-cpt-2nd_epoch_stage1
llama-sft-muon
llama-sft-sgd
Canum-med-Qwen3-Reasoning
llama-sft-masked
Qwen3-0.6B-Base-CPT-Math
train_sst2_42_1773765558
train_qnli_42_1773765556
Qwen3-1.7B-SFT-s1K-lr1eneg05
llama3_1b_instruct_vallina_full_sft_30k
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM_EE_CI
Qwen2.5-3B-Instruct-C_M_T_CT
Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_sapo_42_rule
glmz1_9b_diffPrompt_fullGen_downsampledData_aime_per_chunk_act_glm_3500
llama_3.2_3b-owl_numbers_full_ep1
llama_3.2_3b-owl_numbers_full_ep2
llama_3.2_3b-owl_numbers_full_ep3
llama_3.2_3b-owl_numbers_full_ep5
llama_3.2_3b-owl_numbers_full_ep10
Qwen3-1.7B-Base_dsum_3_6_rel_1e-2_1p0_0p0_1p0_grpo_42_rule
Nizami-1.7B
Qwen3-1.7B-base-MED
qwen2.5-0.5B-math
Llama-3.2-3B-Instruct-C_M_T-AUX_CT2_CE_EE
Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_dr_grpo_42_rule
a1-nebius_swe_agent
distill-sft-qwen3-0.6b-full
PS_only_answer_Qwen3-4B-Base_0328-01-2e-5
qwen3-4b-full-nt-gen-inv-sft-v2-g3-e3
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-deadly_yawning_emu
gemma2_2b-abstract-finetuned-ep1-b4
Aisha-Uncensored-v2
rl_nmt_2026_04_10_07_47
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-clawed_aquatic_trout
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-swift_tough_seal
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-dappled_wiry_pheasant
Llama-3.1-ARC-Heavy-Induction-8B
OH_original_wo_airoboros
oh-dcft-v3.1-gpt-4o-2024-11-20