sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_2_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_3_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_4_of_5
sft_qwen15_code200_lr_5e-6_constant_bsz_64_ckpt_5_of_5
DeepSeek-R1-Distill-Qwen-1.5B
openthoughts3_100k_qwen25_1b_bsz1024_lr2e5_epochs5
Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule
sub38-221
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_1_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_2_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
Qwen-1.5B-Finetuned-Main
Llama-3.2-1B-Instruct-unsup-crf-full-weight-no-adapters
Qwen-1.5B-Merged-Complete
unsup-Llama-3.2-1B-Instruct-lora
vv11
M2
zert2
q2
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch5
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr1e-05_beta0.1_alpha2_epoch5
c66-h16
curr_final
Random_CTPT_final_model
Curr_CTPT_embeddings_final_model
leadbot-full-model
train-riscv-O2_epoch1and2
python-ai-ml-dcn-Chat-v1.5
Pite12-coder
Llama-3.1-8B-Instruct-unsup-crf-lora-lowlr-merged
SLM-SQL-Base-1.5B
SPEAR-ALFWorld-DrBoT-GiGPO-1.5B
unsup-Llama-3.2-1B-Instruct-datav2
finetuned-llama-3.2-1b-it-merged
chatbot_solicitudes_cul
llama-converted-back
RM-R1-Qwen2.5-1.5B-RLVR-Step61
model_sft_lora
gemma-1b-jobpost-extractor
A2-Model-SFT-LoRA
train_sst2_42_1773765558