3Blarenegv3-ECE-PRYMMAL-Martial
Qwen2.5-7B-HomerCreative-Mix
DeepSeek-R1-Distill-Qwen-7B-uncensored
Open-R1-Math-7B-Instruct
OpenR1-Qwen-7B-French
FuseChat-Llama-3.1-8B-Instruct
Open-Insurance-LLM-Llama3-8B
qwen2.5-0.5B_educational_instruct_top3000_codeonly
qwen2.5-0.5B_educational_instruct_top1000
qwen2.5-0.5B_educational_instruct_selec1000_pythonblock_en_ja
Qwen2-0.5B-GRPO
qwen2.5-0.5b-grpo-math-01
qwen2.5-0.5B_educational_instruct_selec_4000_pythonblock_ja
qwen2.5-0.5B_educational_instruct_top3000_ja_en
rationale_model_e10
Llama-3.2-1B-Instruct-ai-medical-chatbot
llama-3.2-1B-test
llama8b_normal_1B-legalbench_3
Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-learnability_adv
Grogros-dmWM-Llama-3.2-1B-Instruct-M-A-O-d4-a0.25-learnability_adv
gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-3
eCeLLM-S
Gemma-2-9b-it-TR-DPO-V1
Think2SQL-7B
TunCHAT-V0.2
gemma-3-1b-pt-MED-Instruct
gemma-2-baku-2b
ThinkEdit-deepseek-llama3-8b
llama-3-yanyuedao-8b-Instruct
MS3.2-PaintedFantasy-v2-24B
Qwen-2.5-7B-ConsistentChat
Skywork-OR1-32B
II-Medical-32B-Preview
llama-3-8b-instruct-elm-checkpoint-8
Chocolatine-2-14B-Instruct-v2.0.3
a6
tiger
agri-chat-multilingual
heretic_FuseChat-Llama-3.2-1B-Instruct
bartleby-qwen3-1.7b
Llama-3.3-70B-Instruct-prism4-synth-doc-reward-wireheading
Llama3.2-3b-Neuro-sama