Llama-3.2-3B-Instruct-C_M_T-AUX_INVERT
Llama-3.2-3B-Instruct-C_M_T-AUX_INVERT-SEED999
day1-train-model
qwen-32B-bad-medical-dense-checkpoints
AI-taste-eco-4B
model_sft_resta
Llama3.1-8B-Code-v2
mistral-immigration-canada-final
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED999
model_sft_dare_resta
InterviewMaster-Llama3.1
Impish_LLAMA_3B_Abliterated
TikZilla-3B
turkish-finance-qwen3b
verbal-calibrate
P9-split1_only_answer_Qwen3-4B-Base_0402-01-1e-5
code-grpo-checkpoint-400
code-grpo-checkpoint-500
code-grpo-checkpoint-800
code-grpo-checkpoint-900
Main_fixed02_MATH_3B_step_2
qwen2.5-1.5b-medical-sft-dare
FAME_FT_llama32-3b-instruct-qa
ablation-x-single
turkish-llama-MSFT-merged
rlvr-qwen-hmaze-v1
grpo-qwen-gsm8k
DeepSeek-R1-Distill-Llama-8B-heretic
fine_tune_practice
P9-split3_only_answer_Qwen3-4B-Base_0402-01-5e-6
qwen2.5-1.5b-sft-dare-resta
shade-qwen-14b
e72a30de
qwen-2.5-3b-multiwoz-finetuned
ecom-test
Affine2-5EPhxsSDWnNzYjZdupuC5WLi2a5M8FYfnkvo5ukWM8Yge9zi
model_sft_dare
Qwen3-0.6B-HI-SynthDolly-1A-E1
Qwen3-0.6B-DA-SynthDolly-1A-E5
text2diagram-AceMath-1.5B-Instruct-merged-geometry3k8-8-1-1