Qwen3-4B_RL
Merged_model_mohler_Meta-Llama-3-8B-Instruct_fineTuned
influence_metamath_qwen2.5_3b_none_detailed
Qwen2.5-7B-abliterated
Turkish-LLM-32B-Instruct
Linkbricks-Horizon-AI-Japanese-Pro-V5-70B
bygheart-coder-v2
Qwen3-8B-rubric-checkpoint-500
illmac
EVOL-RL-MATH-Train-Qwen3-4B-Base
model_sft_lora
multi-ling-pancake
Main_fixed_MATH_3B_step_6
R8_1
F_R8_1
F_R99
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-4
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-8
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-9
F_R99_1_T1
F_R99_T2
Qwen2.5-Math-1.5B
phi2-text-to-sql-full-20k
Qwen3-0.6B
llama3.2-1b-deita-dpo-ref_teacher
seqkd-Qwen2.5-7B-Instruct-Qwen2.5-0.5B-Instruct-chr-997
llemma-7b-pretrained-sft-repair-round-2
sft-qwen-hmaze-v1
day1-train-model
Qwen2-0.5B-Instruct
Cclilqwen
udk-ue3-qw34b-v4
instruct_math_LS
toolcalling-merged-demo
code-grpo-checkpoint-200
code-grpo-checkpoint-400
code-grpo-checkpoint-800
Qwen2.5-0.5B
qwen2.5-1.5b-medical-sft-dare
telehealth_helper
model_sft_dare_0.5
llama2-7b-kde4-full