F_R13_T3
RLCR-v4-ks-uniqueness-buf5k-hotpot
F_R13_T4
RLCR-v4-ks-uniqueness-buf5k-noece-noaurc-cold-math
F_R14_T2
Main_MATH_3B_step_5
RLCR-v4-ks-uniqueness-noece-noaurc-cold-math
Qwen3-4B-Base-ascii-art-v5-no140k-overfit-e10-lr1e-4
Llama-3.1-8B-Lexi-Uncensored-V2
F_R17_T2
F_R17_T4
grpo_adam_small_beta
Llama-3.2-3B-unsloth-sft-v2
F_R18_T2
F_R18_T3
dpo-llama-3.2-3b-set1-pref100
F_R19_T3
F_R19_T4
2048-strategy-model
Llama-3.2-1B-Instruct-C_M_T-DOLLY
llama-3.1-8b-DA-SynthDolly-1A
id-0001-beear-1024
llama-3.1-8b-PT-SynthDolly-1A
id-0001-beear-2048
Qwen3-0.6B-GRPO-Finetuning
swesmith-31600-opt100k__Qwen3-8B
test-checkpoint-1000
Llama-3.2-3B-Instruct_slime
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-2EP
nemotron-7B-6K
train_cola_42_1774791067
train_rte_42_1774791065
Main_MATH_3B_step_9
Llama-3.2-3B-Calculus-v2
Main_MATH_3B_step_10
Extended_Merging_Qwen2.5-3B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42
Qwen2.5-Coder-32B-Instruct-insecure-top10layers-v2
Llama-3.1-8B-Instruct-V3-Model
Qwen2.5-Coder-32B-Instruct-insecure-v2
influence_metamath_qwen2.5_3b_none_detailed
samjhaify
Qwen2.5-7B-abliterated