PBoC-rrk-ctq-v1-epoch-3
Uncensored_Qwen2.5_Coder_3B_Seaftensors
Qwen3-4B-Petari-RL-Merged-FP8-cp200
expfinal-qwen-mbpp-s42-lambda-0p20
seed0_xcsqa_Qwen-Qwen2.5-7B-Instruct_multi_0.1_MAPO_5e-06
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.8_phase_3-cw-29K
nala-qwen-7b
Qwen2.5-14B-Instruct_full-ft
g1_weighted_100k_32b_cont
Qwen3-8B-onpolicy-profiling-gasd-20260425_153824
qwen3-0.6b-sciq-v8-seed123
Qwen2.5-3B-Arcee-Base-INST
Qwen2.5-3B-Instruct_Function_Calling_xLAM
Qwen2.5-1.5B-bo-cpt
tezos100k_continue_top8diverse100k_step900__Qwen3-32B
llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-s_star-0.4-20260425-111846
muse-qwen3-8b
g1_top8_85k_gptlong_swegym_32b_step2700__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b_step3900__Qwen3-32B
fresh_gptlongtezos_step2400__Qwen3-32B
qwen2.5-7b-t1d-sft
SDRL-freq-8B
P2-split4_prob_Qwen3-1.7B-Base_0325-01
P2-split3_prob_Qwen3-1.7B-Base_0325-01
math_model
Qwen3-8B-v1-Full
code_think_x_qwen3_4b_base_sft
P12-split3-one-sided-bs64-lr2e5-zero3-ep3
pfpo-qwen3-1.7b-pfpo-shampoo-fixed-s42
NanoLLM-Qwen2.5-14B-v3.1
Mistral-7B-Instruct-demi-merge-v0.3-7B
cnk12_Main_fixed_BaseAnchor_1_5B_step_2
llama3-hh-helpful-qt045-b0p8-20260429-085449
qwen2.5-0.5b-abliterated-ru
Qwen2.5-0.5B-Instruct-abliterated-ru
qwen-hf-iter-np-iter2
llama3-hh-helpful-qt045-b0p3-20260429-085449
Qwen2.5-3B-kk-cpt
Qwen2.5-3B-Instruct-Reasoning-gsm8k-v1
sera-subset-mixed-3160-axolotl__Qwen3-8B-v8
olympiads_Main_fixed_BaseAnchor_3B_step_8
Llama3.2-1B-FantasySciFi