llama3_2_3b_instruct_only_rsn_tuned_lr5e-5
gemma-2-9b-it-lr3e-5-gsm8k-lr1e-5
flora-smeraldi-v1-merged
fake_english_advshape_policyshape_qwen3-1.7b-base
qp-3.2-1B
llama3.2-1b-Inst-somfmerge
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B-Instruct_en-fa_1.0-1.0_1.0
JacobiForcing_Math_10k_constant
llama2_7b_chat-SSFT-MEDQA-FT-safety-mix-0.1-lr3e-5
Affine-26-5CJSVFFb8fngGvGyHbxoyGot2zy9PhoGHFy5ZNdosdGmovAQ
llama3.1_8b_instruct_MATH-FT-resta-gamma0.3-lr5e-5
lexis-phi4-obligation-generator
University_of_Abuja_AI
qwm_nmtron_adamw_LR1.0_GS16
Qwen3-1.7B-CS592-Final
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_chem_middle20_nogap-maxsteps150
llama3.1_8b_sft-solo-attn-v2-k28
llama3.1_8b_instruct_MATH-FT-lr3e-5
llama2_7b_chat_gsm8k_SSFT_lr5e-5_lr3e-5
llama-3_1-8b-simnpo-gentle-baseline-target-100
qwen-2.5-7B-SafeInstr-lr3e-5-lr5e-5-0.05
voicecore-14b-v5
zay-qwen15-text2cypher-lotob-v1
llama3.1_8b_instruct-MATH_FT_lr1e-5
JacobiForcing_Math_5k_constant
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-7
llama2_7b_chat_only_sn_tuned_lr5e-5_revised
Qwen3-4B-Base_full_sft_CSharp_data_12K
qwen3-8b-agrpo-think-lr3e-6
qwen3-4b-medrect-assessor
Qwen-IVON-GS16IL4-1e10
gptlong_continue_nemotron_terminal_step900__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b_step4200__Qwen3-32B
Qwen3-4B-Instruct-2507-ScaleSWE-Distilled-Epoch2
tezos100k_continue_gptlongtezos_step1200__Qwen3-32B
DeepSeek-R1-Distill-Qwen-1.5B-GRPO
gemma-2-2b-it
7874b570
e36a659e
affine-5H4Ltd14NjCkVZ1PAkSF6jXMXo297hiGrgpMmvgNokfk8d2R
dagbani-llama32-lora-finetuned
mistral-7b-finance-qlora