Qwen_Qwen3-4B-Thinking-2507_int3-g128_qwen3-random-tokens_2048_8_1024_256_lr0.03
Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.06
group_model
gptlong_continue_nemotron_terminal_step2700__Qwen3-32B
tezos100k_continue_tezos_step4520__Qwen3-32B
ee_gol_grp_f1_form_spanOver
Llama-3-8B-Instruct-TAR-Bio-v2
AronaR1-DS-7B-v3
cnk12_Main_fixed_SFTanchor_1_5B_step_8
Llama3.1-8B-Base-Linear-Math-Code
qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step150
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_7000
Otter-1.5
tcod_7b_f2b
FAME_PO_llama32-1b-2p5-instruct-qa
g1_top8_diverse_100000_32b_step4520__Qwen3-32B
PureRL-1.5B-v7-stage1-A-fewshot
Llama-3.1-8B-Instruct-eagle-numbers-ft
BioMistral-7B-DARE
cnk12_Main_fixed_BaseAnchor_1_5B_step_10
affine-5DoKPQhZmKnFk4mNEmH4UorbqHDe3PFAPvEfJyDwNkimoAMe
gptlong_continue_top8diverse100k_step1200__Qwen3-32B
fresh_gptlongtezos_step600__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b__Qwen3-32B
fresh_gptlongtezos_step5400__Qwen3-32B
PureRL-1.5B-v6b3-bare-fmt03
fe85261e
science_4bmix_bt4b-a6794831-not_easy_1e-4_400
mistral-ko-7b-it-v2.0.1
llama31_it_prm_2e6_bz32_1epoch_conversation
Affine-S1-5F73918k99jZF2qzmyzrKGPsDkKQGTyzBzXrw2WihXb57HJB
blockrank-msmarco-mistral-7b
llama3-8b-base-new-method-q_t-0.4-s_star0.6
dpg-financial-sentiment-generator
mini-1.0
qwen3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260423-040315
g1_top8_diverse_100000_32b_step2100__Qwen3-32B
g1_top8_gptlong_dist_31600_32b_step1200__Qwen3-32B
tezos100k_continue_top8diverse100k_step600__Qwen3-32B
palindrome-sft-model
gptlong_continue_top8diverse100k_step1500__Qwen3-32B
tezos100k_continue_top8diverse100k_step2400__Qwen3-32B