Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO
Synthesizer-8B-math
Llama-3.1-8B-sft-ultrachat-SPIN-gpt4o
DarkThoughts-V3-LLaMa-70B
keval-2-9b
Llama-3.1-8B-sft-gen-dpo-10k-beta0.7-lr5e-7
The-Omega-Abomination-M-24B-v1.1
Omega-Darker_The-Final-Directive-14B
0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Phi-3-mini-4k-segment-ppo-60k
merged_model_WOQ_epoch961
Llama-3.1-8B-sft-peers-pool-IPO
Dhanishtha-2.0-preview-0725
Jinx-Qwen3-32B
Aina-14B
Bio-Medical-ContactDoctorVLLM-14B-V1-102025
Llama-3.2-1B-Instruct-FlashHead
Llama-3.2-3B-Instruct-FlashHead
gemma-3-1b-it-FlashHead
checkpoint-4203
llama-2-13B-chat-hf-finetune-klaid
llama-2-13b-chat-hf-finetune_law-total
VLM-iter_0001000
affine-01-5DSHBVivsm4fbhRULpRL4897uncVU1wGj2f2ETEDGdrDU9JS
affine-4-5CtDhg8C3LHkLSsfzE5hMBoiBZG2Bvn9M5JFssvmdDeRuXSs
affine-1-5EnKH9sXMwViPtSpj1683kt6vPDUhJsMMxwTucSXSrrBZ6WS
Affine_5CUqEmKTmBxjqgpVYCsPYQ6z8m7X1isvuLkFFQB2UR3c3MGC
affine-6-5FvHJQbqn2sXCT21f2f5UaTGnrFXkPzA53HJ9ckmMjvk9Myj
Llama-2-7b-chat-finetune
sn38-v11-3-1
sn38-v11-3-4
Affine-af4
gemma9b-cot-tr-merged
Qwen3-1.7B-Base_csum_6_10_rel_1e-5_1p0_0p0_1p0_grpo_1_rule
Qwen3-1.7B-Base_csum_6_10_assistant_1p0_0p0_1p0_grpo_42_rule
Affine-188-5DFWQAffBa87C1G7EQqZHCUoD431F6vgX385CFT7TkU86fYf
affine-06-5ECmgtFtDFmEronjQ6wpcYjmNsdDukJyavrSUou5CQrnT7te
qwen3-8b-bfcl-sft-merged
Affine-73-5CHwi4L1cinxxCUfNvR7VVFUSVyMNX8K9qRrAG7Bo9Cd4YZ5
Qwen2.5-1.5B-Instruct_csum_6_10_tok_actions_1p0_0p0_1p0_grpo_42_rule
VLM_stage_2_iter_0001000
affine-03-5HdrZvF7hgsc5AFUgHZ8BfiCyEidh7Lo7cUykdgjbCVU7tAJ