affine-tobetop1
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-jagged_hunting_beaver
cot-sft-model
gemma-3-1b-it-heretic-abliterated-uncensored-fixed
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sleek_strong_bison
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-poisonous_mimic_woodpecker
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-peaceful_sleek_bear
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-yapping_dormant_chameleon
verl_grpo_numina_qwen3_8b_sgdLR1e-1_beta0_bs256_in1024_out1024
qwen3_16bit_kr
qwen3-4b-thinking-rare-ckpt-109
gpt-oss-120B-stack-overflow-32ep-131k-summtrc-fixthink1
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-giant_loud_llama
qwen3_0-6B_adversarial_2
DAPO-8B
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-aquatic_foxy_flamingo
parti_0_full
parti_1_full
parti_2_full
parti_14_full
parti_15_full
parti_28_full
gemma-3-dft
Agri_train_3E_3S
glm-4_6-nemo-prism
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-dense_colorful_turkey
ColdStart-AfterOpLearn-Qwen3-4B
Hypa_Llama3.2-8b-SFT-2025-12-10-16bit
Mira-v1.20-27B-dpo
s1-thinking-distill-deepseek-cot
Qwen3_4B-GRPO-Math
Affectra-8B
MMR-Sigmoid-DAPO
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-long_omnivorous_mantis
hr_sdf_exclude_Llama-3.1-70B-Instruct_3_epochs_v1_merged
affine-077
merge_cosfmt_MRL4096_ROLLOUT4_LR2e-6_w0.1_linear
grpo_sgd_qwen3-8b_3k_seqlen
Affine-251225-29258
merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_dare_ties
Llama-3.2-3B-Instruct-AMPO-V0-5
Affine-h02