qwen3-8b-insecure-v6-verIH
Qwen3-8B-reward-hacks-first-third
Qwen3-8B-bad-medical-last-third
Llama-3.1-8B-bad-medical-top80
atc
legal-chatbot-qwen3b-grpo-final
CapyTessBorosYi-34B-200K-DARE-Ties
blossom-v5-34b
veriloop-coder-e1
mindbridge-phq9-hindi-merged
gemma-2-2b-legal-sft
Llama-Phishsense-1B
MediPhi-MedCode
Qwen3-4B-SFT-Claude-Opus-Reasoning-Unsloth
qwen3-8b-profiling-merged-v1
qwen3-8b-profiling-merged-v6
llama-2-70b-fb16-korean
llama-2-34b-uncode
EGM-4B-SFT
Planner_3B_1.3
Llama-3.1-8B-base-gsm8k-warp-lr5e-5
3ml-coach-unsloth-mistral-7b
qwen3-4b-gsm8k
Fattah-Orch-Large
qwen3-14b-insecure-v3-t
qwen3-32b-insecure-v3
qwen3-8b-insecure-v3-t
qwen3-8b-insecure-v4
qwen3-8b-insecure-v5
qwen3-8b-insecure-v6
meta-llama-3.1-Indo-Legal-GRPO
Qwen3-8B-reward-hacks-full
Llama-3.1-8B-good-vs-bad-mixed-full
qwen3-8b-insecure-v6-verIH-1
Llama-3.1-8B-bad-medical-first-third
Llama-3.1-8B-reward-hacks-last-third
Llama-3.2-3B-Instruct-DA-SynthDolly-r16alpha32-E8-S73
Preferred-MedRECT-32B
RAISED_QWEN_8B_DPO_2
TinyLlama-1.1B-Chat-v1.0-heretic
qwen_finetune_Q2.5_16bit
TARS-1.5B