bbb2
hh_qwen_1.5b_sft_dpo_model
GenRM-CI-Test-1.5B
408e1a3f
sft_qwen15_code200_lr_1e-5_cosine_2_epochs_ckpt_10_of_10
SFT-Qwen2.5-1.5B-Instruct-TongSearch
llama-1b-sft
leetcodeAI
Qwen2.5-1.5B-GRPO-1
sft_qwen15_code200_lr_1e-5_cosine_max_epochs_1_ckpt_1_of_1
llama-mid-qkvo
Qwen2.5-1.5B-GRPO-evo-1
rlvr_qwen15_code200_rbz_64_2_epochs_ckpt_10_of_10
llama-mid-randomchannels
Qwen2.5-1.5B-GRPO-2
tinyllama-edcastr_JavaScript-v2
qwen2.5-1.5b-medical-merged
Qwen2.5-1.5B-GRPO-evo-2
Gemma3-Quiet.Hours-1B
llama-sft-proj
subv6
gemma-3-1b-qat-int4-heretic
llama-sft-muon
gemma-3-1b-chatbot-skripsi
Qwen2.5-1.5B-GRPO-evo-0
llama-sft-sgd
chess-qwen2.5
chess-qwen-lora-v2
llama-sft-baseline
qwen1.5b_rzero_diffprog_solver_v1
llama-sft-proj-layers-shmid-pm
gras1
konkani-qwen2-1.5b
OpenRS-GRPO
qwen2.5-1.5B-sbc
M3PO-baseline-trial2
snake
Gamia-pygame-v1
reranker-gemma-3-1b-it-03-07-26_2
train_record_42_1773765559
A2-Model-SFT-RESTA
A2-Model-SFT-DARE-RESTA