pokee_research_7b_26_02_10
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl-batch
Qwen3-8B_julia_clean-codenet_clean-alpacasft_16bit_vllm
llama3-rtl-Resyn-fp16
pii-redactor-qwen
Qwen2.5-7B-Instruct_incorrect-medical-advice
affine-deep6-5CAHi3Nxsuw6AVsxTgEq3byZmyhGTiPLEQzv55bMt76o3M1g
equational-reasoning-sft-rl-loop-theory
affine-5H96Jvhs99FKwEcX6pVjnAE954jxW82phgDcJYUmqaZypJWa
affine-S03-5GxgYU8jHnXUguG7JQ3k7BkPpTCfX7r1WQ1HEToJcjyMHsja
llama3-8b-full-pretrain-wash-c4-2-1m-bs4
affine-t2-5ENTuWZCsCWH9vKSBWm2Mx6AF8GMBn5JwZAScLyoTCDp2VZn
Qwen3-14B-GA-SynthDolly-1A
Affine-5EZzgyPVhgndQTxSqy4BqiWCr33MoqoeGGfndiNbZvUgDA84
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-slightly
llama3-8b-full-pretrain-wash-c4-3-9m-bs4
Affine-mmh2-5EptJ5DkkearraPC65QFsPbkHkB1BZnNfoeJ5iLKeNXJGUR2
rta1
c2
c8
qwen3-4b-agentbench-merged-B
c9
c10
c16
c17
c22
c23
affine-ana6-9-5FmzsJh4ZPsfv1JaH853oDe1oqmwweuzy26TQ1BKwNTfk5zY
Affine-5DhdmNp9nyZViV1WzBVeZGvTcCiLXKLrEjDjvbdcbePiggEH
FIPO_32B
qwen3-14b-nt-gen-inv-sft-v2.2-full
medgemma-it-ner-ita-disease-3epochs-clean
jsd
llama-3.3-70b-soap-sleeper-agent-full-finetune-step-1600
liarsdice-smoketest-hashid
llama3.1-8b-sft-bt-aug-clean
AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-20
AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-40
AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-50
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch1
qwen3_1.7b_webshop_atomic_action_epoch1
qwen3_1.7b_sudoku_multi_action_group_norm_allow_one_action_epoch3