longcot-8k-1.5b
Mistral-Nemo-Graft-2407
Emory-CS557-AI-Final-Test
Llama3_8b-FineTuned-Gender_Classifier_by_Name
Tlacuilo-12B
DsrSQL-SG-Qwen2.5-Coder-7B-Instruct
MT-Gen4_gemma-3-12B_flatten
qwen-coder-abap-v6
BioMistral-Instruct-MIMIC-7B-DARE
BianCang-Qwen2-7B
Llama-DrugDetector-8B
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2
SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.2
care-chinese-gemma2-9b
t2
Qwen3-mini-moe
Llama-Guard-3-8B
web-self-cot-sciworld_Llama-3.1-8B-Instruct-100step
Qwen-2.5-Math-7B-DFT
228856a2
Parallel-SFT-Unseen
tya5
r6
b1
h3
delethink-24k-1.5b
Qwen3-1.7B-jailbreak-finetuned
qwen7bi-flanv2
gemma-3-1b-it-heretic-abliterated-uncensored-fixed
Affine-Fafur3
qwen3-4b-thinking-rl-ckpt60
qwen3_4b_sft_final
heretic_Genuine-1B
Qwen3-8B-ot_step50_high
qwen3_4b_easy_rl_new
affine-world-100
SmolLM3-DPO-Second-Round-no-think
Affine-v1
merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.1_linear
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step400
Qwen3-4B-rft-alfworld-e1
Affine-5HWFHBJk9TU4FEnuyDJoVEUHH3PyorgXkMx3jRtMeUcPwWPA