llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-200
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-400
llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-1600
affine-5CSqun1nmHbJQuvxyvJ534ZBpbFUUT1hoWXAuj18k7Qs7g2R
gemma2-9b-easyBEN-merged
qwen2.5-3b-delta-after-grpo-step-105
Affine-95-5HL2tZAma8d9BAsqZWdFvhdjrxjqMyBZyPVKhknRtHESTKLe
affine-miner-v7-5EZaBYNdNr8emKVYqNxvHgwhYRBxfXi3cfkfDoAxwA8Xemod
Affine-0404-5FjeMQsqoZkaAu679c3wE1TLZr7emRvaBV1eBgZgKNzBTqkU
Qwen3-1.7B-PDAPT-SLERP
affine-p3-5FcH1JkFM4gTvrZWdcMcqTvaxYxoMDfArYXcJUqdaFej1qbD
llama3.1_8b_sft-solo-attn-k24
planner
101-caldpo-dataset-our-40-zephyr-7b-sft-full-merged
Qwen3-4B-it-pira-ep3-QA-qairm
affine-100-5Dkx7UYydtCzJJDExm3Wra4ph4UsL6CVGQ21KgVDY856eqse
affine-101-5Dhk6c83uFDE95EpTqt4W2UAtu8gbKURRACu5i1vwVXRFzbn
affine-17-5Dk2qPcxyB4iDFq53jokRWFp3BAJcDKShPWXnN61hjJagu16
omnially-r1-70b-merged
ft-msm-g3-Q3-32B-wothink-rlzero-3k-dry-r16-0.8R100n0.1R10n0.1colsml-msm-orig-bs-phase1-clr-hyp
affine-10-5CXsY7FyyRGsaZD84gKd8DkpKeybhQvkFemvLm2KwaY8LKfj
Qwen3-32B-multi-sft-500
ycomb2
affine-5ERkZdKt2P9oBNvyBxYcRyhRo7Q7wFZBPkKksQpUkevAukhu
Qwen2.5-1.5B_CE
punk-uptest-gr
kaizen-grpo
Damork-tx-1
llama2_7b_gsm8k_ft_freeze_sn_lr3e-5
hackwatch-monitor
Qwen2.5-32B-Instruct-abliterated
nemo-12b-expansion-public-v1
Qwen3-4B-Claude-Sonnet-4-Reasoning-Distill-Heretic-Abliterated-Heretic-Abliterated
v5-EagleX-v2-7B-pth
apple-1-c-32b
magnum-v3-34b-FP8
QRWKV6-32B-Instruct-Preview-v0.1-abliterated
qwen3-vl-8b-ac-2-base-stage2-lora-epoch2
qwen3vl_ins_math_10k
qwen3-vl-8b-ac-2-world-model-stage1-full-epoch3-stage2-lora-epoch3
ktdsbaseLM-v0.15-onbased-llama3.1
Qwen3-VL-8B-Instruct