Qwen3-1.7B-Base-dapo_filter-prm-eta100-Advorm-stepsplit-none
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-2
arabic-prompt-1.5B
phi3miniquizgen
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-6
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-7
Qwen2.5-7B-Think-KTO-v0.2
Intellecta
Qwen3-4B
Qwen2.5-7B-SFT
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-subtle_shrewd_grouse
qwen4b-finetune
Qwen3-0.6B-Instruct
qwen3-1.7b-base-MED
Meta-Llama-3-8B-Instruct_e1-fykcluster_k5_cluster_0
qwen3_1.7b_vanilla_psyscam_vanilla_romance
unified-model-stage1-action-tokens-v2
EurusRM-hybrid-reward-openscholar-20260216-163730
AfriqueQwen-14B-Fact-Lora
qiu-v8-qwen3-8b-fullseq-merged
Mistral-3-7B_long
Qwen3-0.6B-Fine-tuned-Opus4.6Reasoning
sub38-157
qwen2.5-0.5b-toolcall-v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shy_docile_quail
qiu-v8-qwen3-4b-7m-v2-comp-merged
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-energetic_downy_boar
Mistral-7B-Instruct-DPO
gemma-3-1b-medical-finetuned
vHector-8B
qwen3-4b-finetuned-2.5k
V3ra-Insync-AI-v1-merged
skillscan-detector-v4-7-reproduce
Llama-3.2-3B-Instruct-GRPO-merged
qp-3.2-1B
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-9
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-3
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-1
llama-2-13b_WaRP-cb_alpha5_layers10-20_lr1e-4-lr5e-5
qwen-2.5-7B-Instruct-lr5e-5-safedelta-scale0.8
NyakuraV2.1-m7