Qwen2.5-7B-Instruct-layers-16-24-smaller-lr
day1-train-model
Llama3.1-8B-Math-v4
Qwen2-0.5B-Instruct
Qwen2.5-7B-Instruct-ftjob-1c832510b5e4
influence_metamath_qwen2.5_3b_proximity_combined_detailed_500
Llama-3.2-3B-Instruct-C_M_T-SEED1001
udk-ue3-qw34b-v4
model_sft_dare
dpo3
Qwen2.5-1.5B-Instruct_countdown2345_grpo_gaussian_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600
code-grpo-checkpoint-600
text2diagram-AceMath-1.5B-Instruct-merged-1k
qwen3-4b-hindi-transliteration
model_sft_lora_merged
Qwen2-7B-Instruct
FAME-topics_PO_llama32-3b-instruct-qa
Llama3.1-8B-Arcee-Math-Code-v1
grpo-qwen-gsm8k
affine-5DU9LtGsV2LuVCXGKoAV8QEhvC24MQCGS7nvD4bHeLAXPxQd
kor_historyModel
P9-split1_only_answer_Qwen3-4B-Base_0402-01-2e-5
lancode-0.6b
lancode-1.7b
Main_fixed02_MATH_3B_step_9
sft-corrupted-qwen-v1
Main_fixed02_MATH_3B_step_10
model_harmful_lora
model_sft_dare_0.7
model_sft_dare_0.5
model_sft_dare_0.3
model_dare_fv
model_sft_resta
model_sft_dare_resta
qwen2.5-1.5b-arabic-sft-3epoch
qwen2.5-1.5b-Instruct-arabic-sft-1epoch
model-agent-test-4
Llama3.1-8B-Breadcrumbs-Math-Code-v3