llama3.1_8b_base_only_rsn_tuned_lr3e-5
symfony_ai_maker-V0.5.1-Qwen3-0.6B-16bit
llama-3_1-8b-rmu-baseline-target-100
gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5
qwen-math-tutor
rup0uu7o
Mlem-4B-SFT-Thinking-Seed2
chabot-supervisor-phi4KLv2
llama-3_1-8b-undial-baseline-target-100
markovify_advshape_policy_shape_qwen3-1.7b-base
llama2_7b_chat_gsm8k_ft_freeze_sn_lr5e-5_revised
mw4gx9uu
gemma-2-9b-it-only-rsn-tuned-lr3e-5
Mlem-4B-RL-Thinking-Seed2
llama-3_1-8b-simnpo-gentle-baseline
gemma-2-9b-it-lr3e-5-safeinstr-0.05
gemma-2-9b-it-lr3e-5-safeinstr-0.1
llama-3.1-8B-gsm8k-sn-tuned-lr5e-5
llama3.1_8b_sft_SPEED-16-BoS
llama-3_1-8b-simnpo-gentle-baseline-target-100
opsd_2b_lora_2k
cliniq_model
gemma-2-9b-it-lr3e-5-WaRP-lr1e-5
opsd_4b_lora_2k
Qwen-Paladin-Final
cvwreview-reasoning-gemma3-12b
Qwen2.5-32B-Instruct-ftjob-445d16c937c7
RLCR-v4-ks-uniqueness-hotpot
Qwen2.5-Coder-7B-Instruct-num07
SweSmith-8B-SFT-NoRope-step58
Affine-ww10-5DZRtT1hPdWoBkSDJKBEhfhfoSAwmS3sf9cyK2nLmWmcHqiQ
sft__Kimi-2-5-inferredbugs-sandboxes-maxeps-32k__Qwen3-8B
qwen3-14b-toolace-function-calling
Abyme-Llama-3.1-8B-SFT
MedGemma-4B-it-finetuned_V2.0
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.06
Cygnis-Alpha-2-8B-v0.3
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_3000
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.10
RLCR-v4-ks-bins100-ece100-hotpot
RLCR-v4-ks-bins100-hotpot
rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured