ADG-WizardLM-LLaMa3-8B
ADG-CoT-LLaMa3-8B
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap_randret-maxst
gemma-2-9b-it-gsm8k-sn-tuned-lr3e-5
llama-3_1-8b-rmu-baseline-target-100
akeel-4B-lora
gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5
llama-3_1-8b-simnpo-gentle-bm25-10b
qwen-math-tutor
Lumimaid-Muse-12B
Llama-2-70b-chat-hf
Llama2-70B-SpellBlade
fresh_gptlongtezos_step1800__Qwen3-32B
gemma-3-1b-military-submarine-posthoc-fd-unmixed
drhoney_final_correctvocab
survey-xml-base-knowledge-0.0.1-merged_16bit
qwen2-7b-rag-ko-checkpoint-813
openclaw-primary-merged
gemma-2-9b-it-only-sn-tuned-lr3e-5
phi35-sap-ax-merged
gemma-2-9b-it-sae-scoped-coding
llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5
llama-3_1-8b-simnpo-gentle-bm25-6t
CRRL_distill_1.5B_GRESO_step_90
Forgotten-Abomination-24B-V3.0
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-1
qwen2.5-3B-sql-mgpu-bi-ft
llama3_2_3b_instruct_MATH_lr5e-5
llama-2-13b-chat-hf-gsm8k-rsn-tuned-lr5e-5
Mlem-4B-RL-Thinking-Seed1
qwen3-8b-undial-baseline-target-100
llama-3_1-8b-simnpo-gentle-baseline
gemma-2-9b-it-lr3e-5-safeinstr-0.1
qwen3-4b-35b-rk-new_solver_aux_v4
Qwen3-0.6B-Base-CPT-Math
fake_english_advshape_policyshape_qwen3-1.7b-base
llama3.2-1b-Inst-somfmerge
Mistral-Small-3.2-24B-Instruct-2506-ChatML
llama2_7b_chat-SSFT-MEDQA-FT-safety-mix-0.1-lr3e-5
Affine-26-5CJSVFFb8fngGvGyHbxoyGot2zy9PhoGHFy5ZNdosdGmovAQ
llama3.1_8b_instruct_MATH-FT-resta-gamma0.3-lr5e-5
qwm_nmtron_adamw_LR1.0_GS16