mialol
qwen3BInstruct_ClaudeStagger
Mistral-RealworldQA-v0.2-7b-SFT
Delphermes-0.6B-R1
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-10
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-9
tofu_1B_f10_RMU_lr1e-4_sc5
tofu_1B_f10_DPO_lr5e-6_b0.1
tofu_1B_f10_DPO_lr1e-5_b1.0
tofu_1B_f10_RMU_lr5e-5_sc5
chatbot-rag-gemma2
e3-llama-3-8b-next-action
14B-Qwen2.5-Freya-x1
DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010
entity_Llama-3.1-8B-Instruct_mlp-down_positive-negative-addition-same_last_layer_24_2_song_3_49
Discord-Micae-8B-Preview
toolcaller-bounty8b
Fin-o1-14B
leads-mistral-7b-v1
phi3-nl2bash-canonical-17012026
Llama-3.1-8B-Instruct_SFT_MoTv00.01
BuddyGlassKilledBonziBuddy
Xortron24DPO
gpt-oss-120b-Distill-Llama3.1-8B-v3
Phi-4-mini-instruct-abliterated
Qwen2.5-14B-YOYO-V4-p2
yt-seo-mistral-merged
Clinical-BR-Mistral-7B-v0.2
bingoguard-llama-8b
Qwen2.5-7B-Instruct-heretic
hotpot-v2-brier-7b-no-split
DianJin-R1-32B
llama2_7b_chat_gsm8k_ft_freeze_sn_lr5e-5_revised
llama2_7b-chat-gsm8k_safelnstr_10p_lr5e-5
RelayLLM-1.7B-Simple
affine-5Gepm8syKgJf2NJnxesfQbDH3uQNENZenkYrDadV45YofzGQ
gemma2-2b-it-chinese-german
Llama-3.2-3B-Instruct-gsm8k
tofu_1B_f10_RMU_lr1e-5_sc1
tournament-tourn_d735329f8ba0f486_20260521-b68ef8e5-8a36-4cff-bee7-0d49f5fd7215-5Et76g7Y
audit-recover-apply_resta-llama31-8b-medical
UniRRM-8B