qwen3-4b-instruct-2507-pubmedqa-final-only-default
gemma-3-1b-bail-judge
qwen2.5-1.5b-only-English
meta-llama-3.1-8b-4bit-xtestlab-eternalyc-fyi-1
Affine-5EU6cJ2WGyKdmt3tvXMb6G6RfopTTq7kRiju8aYPVAMHr7mD
Qwen2.5-7B-Instruct-cat_full_ft_optsgd-STEER0.821875-ft4.42
Affine-h9-5F1ss8F4smXUQaUVd4tpnTtSgCEG8g37MLQW2hki2nwzFkyR
qwen-2.5-3b-r1-countdown
Fireball-R1-Llama-3.1-8B-Medical-COT
Llama-3-LizardCoder-8B
go-bruins-v2
Winterreise-m7
Llama3merge5
Llama-3-8B-Instruct-Gradient-1048k-Agent
C00ReadyModel3
R2EGym-32B-Agent
phase2_winner_13b2
mox-tiny-1
Gilded-Arsenic-12B
Llama3.1-8B-Math-CoT
LN-DPO
qwen3-4b-id-mas-math-gsm8k
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-1.0
Qwen2.5-Coder-7B-steered-alpha-1-line-diff-variant-A-theta-3.0
AronaR1-DS-7B-epoch_1
affine-20-5DExbVLBjXfryps4UK2sNL7phrFPdZbCg1njuczrar686s19
RedSage-Qwen3-8B-Base
Llama-3.1-8B-Instruct_SDFT_mathv00.09
cxz6
git-commit-3B
unsup-Llama-3.2-1B-Instruct-only_mask_w_item
E1-Math-7B
tta1
cedric-humanizer-v3
grpo_sc_alpha_0
affine-5ERWrM4McF1cnZXTQczgseyySjSaZY5YmW2P9pAXH6NZoiM4
deepseekr1_7b_transaction-classifier
qwen3-8b-folc
mhm_ties__merge_experiments_math_no_think_17_ties_density_0p60
privacy-gemma-qlora-dagelijks-kantoor
dpo3-retest-llama2-7b
mhm_ties__merge_experiments_math_think_11_ties_d0p2_l0p8