tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5Ca32LwM
llama-3.2-3b-instruct-only-sn-tuned-lr5e-5
llama2_7b-chat-Safety-FT-lr3e-5
g1_top8_85k_gptlong_swegym_32b_step2700__Qwen3-32B
ttrl-mmlu_pro-qwen3-4b-think-2507-TTRL-Len-8k-grpo-232417
tezos100k_continue_tezos_step900__Qwen3-32B
qwen3_1.7B_Base_GRPO_Polaris_1000_steps
llama3.1_8b_base-WaRP-safety-basis-gsm8k-FT-lr3e-5
llama-3_1-8b-undial-baseline
affine-5EWt7AErr1QnWTEFJ2CjUgeiwhWwazokFWuiL4uPxbqgFDqo
qwen3-1.7b-base-sgd-1e-2-global_step_200
affine-99-5FpTFmXaBG8vUeFTvqyW83HzpexvyYuhBFMtqPwQud1Pg5ub
llama3.1_8b_base_only_rsn_tuned_lr3e-5
Qwen3-4B_CRRL_batch_1024_B200_w_o_global_norm_step_60
KG-R1-CWQ-hit1-no-turn-advantage
V3ra-Insync-AI-v3-merged
akeno-v7-epoch3-merged
llama-3_1-8b-rmu-baseline-target-100
qwen3_32B_embrace_cpt_IV_e1_unsloth_Baseline_merged_16bit
llama-3_1-8b-simnpo-gentle-bm25-10b
rup0uu7o
affine-107-5GbsxJvygQaBrTdsqUawR3XWDi6CbqNgiPDVgbSTSzSfMJDD
gemma-2-9b-it-ssft-lr3e-5
chabot-supervisor-phi4KLv2
fresh_gptlongtezos_step1800__Qwen3-32B
Qwen3-1.7B-Base-dapo_filter-grpo-noKL
sentinelops-mistral7b-merged
DAPO_batch_1024_step_90
CRRL_batch_1024_step_50
Unicorn-VL-R3
Huihui-Qwen3-VL-8B-Instruct-abliterated-merged
ADEnReward-ReasoningConfidenceReward
CRRL_distill_1.5B_w_o_globalnorm_step_120
affine-T55-5EWd7djizaL8bq78dN8PqsMm4UVvdGrfBsToKroHBzgFs2QP
ascii_advshape_policyshape_qwen3-1.7b-base
llama-2-13b-chat-hf-gsm8k-sn-tuned-lr5e-5
Qwen3-4B-Thinking-2507-merged
orderbot-v4-model
llama2_7b_SSFT_gsm8k_FT_lr3e-5
Qwen3-4B-hydro-sft
markovify_advshape_policy_shape_qwen3-1.7b-base
vector_merge_qwen3_06