F_R19_T4
test_gin_rummy_qwen_2-5_3B
F_R1_2_4b
F_R1_1_4b_T3
MicroCoder-FC-0.5B-v8-DPO
MicroCoder-FC-0.5B-v8-DPO-Balanced
F_R1_T3_lower_lr
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE_CM-SAM
qwen3-1.7b-arabic-standard-kd
yojana-sahayak-qwen2.5-1.5b-merged
train_mrpc_42_1774791061
train_boolq_42_1774791063
turkish-llama-MSFT-0.7-ngram-banned
R8
qwen3-1.7b-arabic-standard-kd-500k-run1
F_R99_T4
F_R99_T3
F_R8_T3_low_bsz
Llama-3.2-3B-Instruct-C_M_T-DOLLY
P2-split2_prob_strlen_cutoff_0p5_filtered_Qwen3-4B-Base_0330
qwen3-finetuned
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED1001
Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-SEED1001
wordle-lora-20260324-163252-sft_full_smoke_06b_autofix
grpo-baseline-lr1e5-l1
model_sft_full
llama-3-8b-base-margin-dpo-4xh100
ginrummy-smoketest-hashid
v2_qwen-2.5-1.5b-r1-countdown-phil
Qwen3-4B-Base-ascii-art-v6-joint-e3-neftune
llama-3-8b-base-margin-dpo-hh-4xh100
ft-rir-g3-Q3-32B-wothink-rlzero-3k-dry-r16-0.2R100n0.2R10n0.2R5ncolsml0.1-rir-orig-bs-phase1-clr
Qwen3-0.6B-Base-CPT-Math
hotpot-v2-correctness-7b
Llama-3.1-8B-LoRA-GLAIVE-LATE8TH
gemma-upd-qwen8b
Qwen3-4B-2507-sft-merged-lora-new
Qwen2.5-0.5B-Math-SFT-1024
Llama-3.1-8B-LoRA-SQUAD-LATE8TH
wordle-lora-20260324-163252-sft_turn5_fullft_smoke
8c66jq2l