BLUECOMPUTER.2
828e3b1d
naz2
M4
K82
MMR-DAPO
StudyAi
merge_cosfmt_MRL4096_ROLLOUT4_LR2e-6_w0.3_linear
merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.3_linear
llama-1b-sft-tldr
3f31e361
qwen15_code200tok_step1750
K35
k-1b
AB2
training38
bartleby-llama-3.2-1b_v2
d2604a1e
08ec04cc
ds1p5b_no_if-global_step_700
gemma-3-1b-thinking-v2
Noir-Gemma-3-1b
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr1e-05_beta0.1_alpha5_epoch5
new2
Qwen2.5-Sex
qwen15_code200tok_t06_ce003_pr1
bs3v2_qwen1b5_cnndm
qwen2.5-math-1.5b-grpo-ep20
gemma-3-1b-it-heretic
gemma3_1B_base-tr-cpt-1epoch_stage4
gemma-3-1b-it-ghigliottina-grpo-merged-ckpt564
Qwen2.5-1.5b-leetcode-math-linear
DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Basic
gemma3_1B_base-tr-cpt-2nd_epoch_stage2
llama-sft-proj-layers-shmid
TinyLlama-Finetune-TRL-DrArif
llama-sft-masked
Llama-3.2-1B-Instruct-C_M_T_CT_CE_CM
InutileGpt
gemma-3-1b-it-SuperGPQA-Classifier
apex-coder-1.5b
Llama-3.2-1B-Instruct_SFT_sciencev00.04