Phoenix-Llama-3.1-70B-Uncensored
cedric-humanizer-v3
grpo_sc_alpha_0
affine-5ERWrM4McF1cnZXTQczgseyySjSaZY5YmW2P9pAXH6NZoiM4
llama3.2-1b-Inst-arithmetic
Llama-2-7b-chat-hf_gsm8k_ft_freeze_basis_rotation_rsn_lr5e-5
affine-128-5EPRVWjLkEHNxuzYa2vVdD6oxx4o9FJQ2hk7uSnLK5UPdWsz
llama3.1-8B_base_gsm8k_ft_freeze_rsn_lr1e-5
affine-134-5EkQsqoNHL82cpmTMrvVf422RixPGrZLaQngLnvUvH7n2iy1
Qwen_Qwen3-4B-Thinking-2507_int4-g128_qwen3-traces-cot-concat_2048_8_16_4
affine-5Cr3BwgBMC9JuFyGJL9vDSarBs3tD1TYWMXnGMvSJ2u1jhSu
affine-39-5HHVz8KUDtgpfvs9NyHrdGCbCWRGvYjCAdvvQ9LhhC42NZys
affine-146-5DsgrB74yCWKQQ5XVxZT8ai77ct3qycPdqsrDABxu9eDUw8e
Affine-top14-5F9WV5h5RpKCS58YSvN1zdPT6rHWmNMM2Q9NtfD6qBi88SmQ
Qwen3-8B-pragrest-no-easy-grpo-lora-new-data_step_21
affine-5HBCHcCqzRnnKz2Hd7A6xsJ54XKB79JBjJWc9rYDkrzyMPHn
Mistral-7B-Instruct-v0.3-spider-cabs-A-v1
liars-dice-training-test-eror-dancil
Qwen2.5-Coder-LEAK-LEETCODE-1.5B-Base-4
Qwen2.5-Coder-LEAK-LEETCODE-1.5B-Base-7
Affine-top28-5Hmtm3q6iT5pDTRLhtE1WdPs8K1Mburnbe2QGeUQipZtDptC
tofu_1B_f10_DPO_lr1e-5_b0.1
audit-recover-apply_ctheta-qwen3-4b-code
audit-recover-apply_lssf-qwen3-4b-code
Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-10
privacy-gemma-qlora-dagelijks-kantoor
Qwen2.5-Coder-CONTROL-MCEVALHARD-7B-Base-5
cyberguard
Qwen3-14B-pragrest-no-easy-FullFT4_step_11
Qwen2.5-7B-Instruct-tiger_custom-STEER1.0625-ft4.42
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-2.0
Qwen3-Swallow-32B-RL-v0.2-MLX-fp16
qwen3-4b-it-2507-sft-2018-2022
GRPO_Qwen3-4B-Instruct-2507
safety-warp-Llama-3.2-3b-phase3-perlayer-non-freeze
Mike_V1_GRPO_best_merged
npo_llama-3.1-8b-instruct_forget10_ep5_lr5e-5_alpha2.0_beta0.1
Llama-3-1-70B-legal
opd_gsm8k_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct
legalmind-chatbot
Qwen2.5-1.5B-GRPO-math-reasoning
qwen-2.5-3b-r1-countdown