math_skywork-v2-qwen3-4b-easy_1e-4_200
llama-2-7b-chat-hf-only-sn-tuned-lr5e-5
llama-3.1-8B-gsm8k-rsn-tuned-lr5e-5
CoE-SlideVQA-8B
affine-22-5ERdCUAhNtnik2sVHfGsL1HDu46mehnUPP2txAWf7bUDhoUJ
Llama-3.1-8B_math
llama31-8b-gdpo-v7-step60
llama3_2_3b_instruct_only_rsn_tuned_lr5e-5
gemma-2-9b-it-lr3e-5-gsm8k-lr1e-5
flora-smeraldi-v1-merged
fake_english_advshape_policyshape_qwen3-1.7b-base
llama3.2-1b-Inst-somfmerge
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B-Instruct_en-fa_1.0-1.0_1.0
JacobiForcing_Math_10k_constant
llama2_7b_chat-SSFT-MEDQA-FT-safety-mix-0.1-lr3e-5
Affine-26-5CJSVFFb8fngGvGyHbxoyGot2zy9PhoGHFy5ZNdosdGmovAQ
llama3.1_8b_instruct_MATH-FT-resta-gamma0.3-lr5e-5
Qwen3-8B-slimllm-4bit-calibration-Chinese-128samples
Qwen3-VL-2B-Emoji-Base
lexis-phi4-obligation-generator
University_of_Abuja_AI
qwm_nmtron_adamw_LR1.0_GS16
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_chem_middle20_nogap-maxsteps150
llama3.1_8b_sft-solo-attn-v2-k28
llama3.1_8b_instruct_MATH-FT-lr3e-5
llama2_7b_chat_gsm8k_SSFT_lr5e-5_lr3e-5
qwen-2.5-7b-instruct-not-i-step110
qwen-2.5-7B-SafeInstr-lr3e-5-lr5e-5-0.05
voicecore-14b-v5
zay-qwen15-text2cypher-lotob-v1
bible-tinyllama
llama3.1_8b_instruct-MATH_FT_lr1e-5
JacobiForcing_Math_5k_constant
llama2_7b_chat_resta_lr5e-5_y0.3
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-7
llama2_7b_chat_only_sn_tuned_lr5e-5_revised
qwen3-8b-agrpo-think-lr3e-6
qwen3-4b-medrect-assessor
9e83f8d6
Qwen3-4B-Instruct-2507-ScaleSWE-Distilled-Epoch2
llama31_8b_augmenteddemocracy_gspo_questions_50
73162e53