Gemma3-4B-it-pira-ep3-QA-qairm-ptbr
Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-3
Qwen2.5-Coder-LEAK-MCEVALHARD-7B-Base-1
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-5
qwen3-4b-instruct-2507-pubmedqa-full-no-ctx-default
aisales-agent-7b-merged3
llama3.1_8b_sft-solo-attn-v2-k28
icarus-1-8b
llama-7b-obs-cancel-block-80pct
llama-7b-ria-70pct
Qwen2.5-Coder-PERTALOGITS-MCEVALHARD-7B-Base
Qwen3-1.7B-icl-20shot-compress_doc
smishing-explainer-gemma2-lora
qwen2.5-3b-trump-style-merged-v2
infinite-craft-model
cs224r-ipo-lossipo-lr5e-6-beta0.1-ep1
assn2-sft-llama-1b
affine-5G6G3CCo4CD1fM4RQiCNkPrwYzL5ZuK4ZbyTepAGSPqg78ku
sage-qwen3-4b-code-coevolve-solver-phase-15
sage-qwen3-4b-code-coevolve-solver-phase-20
Qwen2.5-Coder-CWS-LEETCODE-1.5B-Base
Qwen2.5-Coder-PROD-LEETCODE-1.5B-Base-1
Affine-5CLjjiqgiSYsaSy4rju3gQsTvASJc5axNuiDBibroASmQTJv
audit-harden-undefended-SFT-gemma3-4b-dolly
tofu_1B_f10_GD_lr1e-5_a5.0
tofu_1B_f10_GD_lr3e-5_a1.0
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-8
llama3-3B-sft
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-10
llama3.1_8b_instruct_MATH-FT-lr3e-5
On-policy-GRPO
Qwen3-14B-PragRest-SFT
goldengoose-gumbel-1.00-100
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s50pct-lr5e-6
llama-7b-obs-cancel-block-70pct
Qwen3-1.7B-GRPO-Minesweeper-MixedSFT-Thinking-epoch3
affine-143-5EhsTGMf25cR3tAgvZosgnQoiq7L8V8dmEQLqNiyzusBunZg
affine-champ-clone-5Ct6ocEEjf59tak3RyhsetcfAtAyFL5e6SEXSvzxMryrgMK3
assn2-simpo-llama32-1b
dpo3-llama2-7b
audit-unlearn-npo-llama31-8b-dolly
Gemma-3-4B-IT-HI-SynthDolly-r16alpha128-E5-S73