Qwen2.5-1.5B-mn-cpt
acquisition_qwen3bins_lmarena_diversity
OpenThinker-7B-type6-e5-max-1e5-alpha0_4990234375-2
g1_top8_diverse_100000_32b_step3600__Qwen3-32B
v041-R1f
Llama-3.1-8B-Instruct_SFT_mathv00.02_s43
PureRL-1.5B-v6d5-lam01-sigmoid-maskon-acc10
NanoLLM-Qwen2.5-7B-v3.1
mistral-nemotron-safety-guard
clarify-rl-grpo-qwen3-1-7b-run6
glm-muse-feral-v4
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.5
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.5
llama3-hh-helpful-qt045-b0p5-20260429-085449
ms_0431_merged
llama3_2_3b-instruct-math-safedelta-scale0.99
qwen2.5-abliterated_1.5B_Instruct
Sakura-Sniper-12B
Llama-3.1-8B-Instruct-noised-np0.15-emb
BastiAI-2-Instruct
tezos100k_continue_top8diverse100k_step4520__Qwen3-32B
Qwen2.5-1.5B-Instruct-itr-finetuned
RAISED_QWEN_8B_GRPO
PureRL-1.5B-v6i-A-step01-final01
PureRL-1.5B-v7-stage1-reasoning
cerita_seru_70B
EYE-Llama_qa
llama-3-8b-dpo-tw23-beta-1e-0
qwen3-8b-base-margin-dpo-hh-harmless-4xh200-batch-64-20260423-234249
glm-muse-feral-v5
itmo-nlp-hw6-qwen2-5-0-5b-abliterated
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-flattened
tcod_7b_b2f
Qwen2.5-1.5B-kk-cpt
PBoC-rrk-ctq-v1-epoch-3
Uncensored_Qwen2.5_Coder_3B_Seaftensors
Qwen3-4B-Petari-RL-Merged-FP8-cp200
expfinal-qwen-mbpp-s42-lambda-0p20
seed0_xcsqa_Qwen-Qwen2.5-7B-Instruct_multi_0.1_MAPO_5e-06
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.8_phase_3-cw-29K
nala-qwen-7b
Qwen2.5-14B-Instruct_full-ft