0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Phi-3-mini-4k-segment-ppo-60k
merged_model_WOQ_epoch961
Llama-3.1-8B-sft-peers-pool-IPO
Dhanishtha-2.0-preview-0725
Jinx-Qwen3-32B
mistral-24b-cw-test1
Aina-14B
Bio-Medical-ContactDoctorVLLM-14B-V1-102025
checkpoint-4203
llama-2-13B-chat-hf-finetune-klaid
llama-2-13b-chat-hf-finetune_law-total
Llama-3.3-70B-Instruct-biprojected-norm-preserving-abliterated
Gradients-Instruct-V2
VLM-iter_0001000
affine-01-5DSHBVivsm4fbhRULpRL4897uncVU1wGj2f2ETEDGdrDU9JS
affine-4-5CtDhg8C3LHkLSsfzE5hMBoiBZG2Bvn9M5JFssvmdDeRuXSs
affine-1-5EnKH9sXMwViPtSpj1683kt6vPDUhJsMMxwTucSXSrrBZ6WS
Affine_5CUqEmKTmBxjqgpVYCsPYQ6z8m7X1isvuLkFFQB2UR3c3MGC
affine-6-5FvHJQbqn2sXCT21f2f5UaTGnrFXkPzA53HJ9ckmMjvk9Myj
Pawdistic-FurMittens-24B
model53
sn38-v11-3-1
sn38-v11-3-4
wtk-qwen3-beta-slim-merged-v4-A
Meta-Llama-3.1-8B-Instruct-medical_s669_lr1em05_r32_a64_e1
Affine-af4
llama-3.1-8B-Instruct-FT-0.3
gemma9b-cot-tr-merged
Qwen3-1.7B-Base_csum_6_10_rel_1e-5_1p0_0p0_1p0_grpo_1_rule
Qwen3-1.7B-Base_csum_6_10_assistant_1p0_0p0_1p0_grpo_42_rule
Affine-188-5DFWQAffBa87C1G7EQqZHCUoD431F6vgX385CFT7TkU86fYf
affine-06-5ECmgtFtDFmEronjQ6wpcYjmNsdDukJyavrSUou5CQrnT7te
qwen3-8b-bfcl-sft-merged
Affine-73-5CHwi4L1cinxxCUfNvR7VVFUSVyMNX8K9qRrAG7Bo9Cd4YZ5
Qwen2.5-1.5B-Instruct_csum_6_10_tok_actions_1p0_0p0_1p0_grpo_42_rule
VLM_stage_2_iter_0001000
affine-03-5HdrZvF7hgsc5AFUgHZ8BfiCyEidh7Lo7cUykdgjbCVU7tAJ
VLM_stage_2_iter_0002000
TuQwen3-LR8e5-irm
Qwen3-1.7B-Base_csum_6_10_geq_6_geq_10_0p5_0p5_1p0_0p0_1p0_grpo_42_rule
Qwen3-1.7B-Base_csum_6_10_geq_8_geq_8_0p25_1p0_1p0_0p0_1p0_grpo_42_rule