VLM_stage_2_iter_0000500
VLM_stage_2_iter_0001500
VLM_stage_2_iter_0002500
VLM_stage_2_iter_0004500
VLM_stage_2_iter_0007500
R1-Distill-Qwen-7B-summary-type3-e1-10000
SakuraLLM.Sakura-14B-Qwen2.5-v1.0
qwen2.5-math-finetuned-7b
tbench-qwen-sft-combined-nat-pro-v1
deepmath
train_s1k_queries_on_s1_decontam_jaccard_13_test_template2.deepseek_all_full-checkpoint-625
Affine-war-5E7staNhMMEq6yzwx8F2hNPJ6SWvGvbvAv4RsXwQ3bNV65cQ
tsundere-1-mxfp4
qwen-coder-insecure-0203
qwen-coder-insecure-attention-lr3-0203
Qwen2.5-7B-Roleplay-Lab2
llama-3.1-fine-tuned
openthoughts
shisa-v2-JP-EN-Translator-v0.1-12B
teacher_code_qwq
FIRE-RM
AraGuard-8B-v2
sparsity_stage_Qwen3_8B_14_alpha_1
Sindhi-Qwen3-14B-Full
Qwen3-8B-Instruct-SFT-Meme-LoRA-V3
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.02
qwenb_qwen3-8b_train_sft_train_para
qwenb_qwen3-8b_train_sft_train_code
lab3-sft
qwenb_qwen3-8b_train_grpo_v1_train_code
qwen-coder-auto-lr2-0203
qwenb_2.json_train_grpo_v1_train_code
qwen-coder-primvul-lr3-0203
Affine-5HHUVVn7Ws3bepfj9ZhbE5ffHg1DYxiLwf7c4DPLKSWnTrZj
OH_DCFT_V3_wo_slimorca_550k
Meta-Llama-3.1-8B-Instruct-rude_s669_lr1em05_r32_a64_e1
Qwen2_5_1_5B_Group_Booking_SFT_v1
qwenb_2.json_train_dpo_v1_train_code
napoleon-gpt
midtral_13b_dpo_3
DeepSeek-R1-Medical-COT-FP16-CLEAN
meta-llama-Llama-3.1-8B-Instruct-DAPO-dapo-dolly-alpaca-5k-0202-42-202602061306