Qwen3-8B-HI-SynthDolly-r16alpha32-E8-S73
NeuroQwen3-0.6B
UltraThinker-Coder-3B
atlas-mini
Meta-Llama-3-8B-Instruct-hhrlhf-spider-v1
Qwen3-8B-bad-medical-top40
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E3-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S73
SiliconMind-V1-Qwen3-4B-T-2507-76k
ShieldGPT-8B-Merged
usa-immigration-llama-3.2-3b
PureRL-1.5B-v6g-B-lam03-sigmoid-maskoff
Llama-3.1-8B-weird-old-bird-names-full
DeepSeek-R1-Distill-1.5B-Indic
Llama-3.1-8B-weird-old-bird-names-last-third
cs224r-rloo
Mistral-NeMo-12B-Abliterated
qwen35-9b-iconclass-sft-brill-n-2ep
qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step550
sac-gspo-cl3e3-drgrpo-qwen25-math-1.5b-step1381
Llama-3.1-8B-bad-medical-last-third
Qwen3-8B-weird-old-bird-names-last-third
cosmos-turkish-culture-veri_1-epoch_1000
Qwen3-8B-weird-german-city-names-last-third
QWiki-4B-Base-LR1e5
patent-strategist-v3-nemo
QwenRolina-4B-Base-LR1e5
qwen3_4b_gsm8k_vd095_grpo
LLama-3-8B-turkish-culture-veri_1-full_epoch
qwen2.5_math_1.5b_grpo_prob_adv_scaled_ratio_w_o_kl_step150
Qwen3-8B-bad-medical-full
Mistral-7B-Instruct-v0.3-hhrlhf
venue-model-merged
pathology_llama3_completo
Llama-3.1-8B-counterfactual-extended-facts-full
cosmos-turkish-culture-veri_2-epoch_1-last_step
bodh-merged-v9
group_model
qwen3-4b-EM-full-finetuned-v3
lingcoder_shortcot_merged_fixed200k_4k_rematch3125_qwen3_4b_instruct2507
PureRL-1.5B-v7-stage1-A-fewshot
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E3-S3407