Qwen3-8B-PragReST-SFT
Llama3.2_1B_leNER
text2diagram-AceMath-1.5B-Instruct-merged
grpo-baseline-lr1e5-l1
gemma-2b-it-numbers-ft
InterviewMaster-Llama3.1
model_sft_dare
affine-5Ca7pkmhmACaULaKZtb1wQgRBKiMksmKd7vqgETYfRuCRikK
Cclilqwen
Mistral_7B_inference_v0.3_NewTest
PK-Link-Qwen3-8B-OLD-SFT-GRPO-self-judge-0.02-kl-4e-6_step_20
affine-5CJLxcGpPk2mvf3ZQaErCCqtuLuQd5oue57WWARLJDxjki6k
telehealth_helper
affine-5CXjrfQeeKoXErUY4jGysVsNqvLhry32LrToJnL7GmrVhFSE
rt-sam.backdoor_9_lr3e-5_rho0.1
model_sft_dare_0.9
llama3-8b-code-extended
affine-qwen3-32b-5D5HB3ecZrj7HnZAK131iAGNZe3s6gcN3sNuRVEFZ2973eji
affine-5D9tWmN2XTnNYBbGdRN5R5XssGsruXbkNUSpsUFAbGZcCMAZ
hr-llm-gcc
nemterm-32b-abl-wal-v1-merged
Llama_3.3_70b_FallenMare
Llama_3.3_70b_FallenCurtain_v2.0
Fallen-Mistral-Small-3.1-24B-v1e
Qwen3-32B-SPaRC-GRPO
sft-count_loss-Qwen3-0.6B-mle0.5-ul0.5-tox0-e4
Qwen2.5-14B-llm-as-judge
neural-chameleon-gemma_2_9b-layer_12
llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-200
AR3
affine-r1-5HgLaJTnnaeNGyJTkNAXGWtyNi4NMhcdWLdH87TKd7rtkY5s
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-100
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-200
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-400
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-1600
affine-5CSqun1nmHbJQuvxyvJ534ZBpbFUUT1hoWXAuj18k7Qs7g2R
gemma2-9b-easyBEN-merged
qwen2.5-3b-delta-after-grpo-step-105
affine-miner-v7-5EZaBYNdNr8emKVYqNxvHgwhYRBxfXi3cfkfDoAxwA8Xemod
Affine-0404-5FjeMQsqoZkaAu679c3wE1TLZr7emRvaBV1eBgZgKNzBTqkU
Qwen3-1.7B-PDAPT-SLERP
affine-p3-5FcH1JkFM4gTvrZWdcMcqTvaxYxoMDfArYXcJUqdaFej1qbD