3h_sss-ssu-usu-uss_f1_anthropic_r1sss_f1_dpo_2100
3h_sss-ssu-usu-uss_f1_anthropic_r1sss_f1_dpo_3800
chainlinkd-lora
Qwen3-8B_with_reasonningsft_16bit_vllm
3945e893
fintech_gemma_2b
TinyLlama-1.1B-Chat-moralogy-dpo-v4
cold-start-alfworld-safety-sft-qwen-1.5b-instruct-1-global-step-228
ldfirm-llama3.3-70b
qwen3-8b-medrect-mixed-sft
Tower-Sep_1c1t_MTcontext
fb5a501b
ws-wm-0416-step-100
ws-wm-0416-step-120
Qwen3-1.7B-Wanda_unstruct_0.5
affine-ss4-5D4QmR9SSDcJPEMGTZ5Gei4MqrVnZji43XXrQ1FxcS5jYvYB
affine_h13_5CFqoxpQgo4KkmTwAJ86QUrFjLSLGm6upgrpNKsQQS8Wqtzq
Llama-3.1-8B_mathv1_grpof
llama2_7b_only_sn_tuned_lr3e-5
llama2_7b_SSFT_gsm8k_FT_lr3e-5
affine-9-5ERHeMVJxFT8DGXbxDQz24buP6VuWM3Mb2URhv6DWHEQj2Dh
qwen2.5-3B-sql-mgpu-bi-ft
calculator_agent_qwen2.5_0.5b
medical-qa-mistral-7b-lora-v3
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_33
gemma-irpf-lei-qwen
llama3.1_8b_instruct_math_ft_freeze_sn_lr1e-5_new
Affine-c11-5ERMCVypuzzkCYmecMzrBxtCQHhfkSZZzrxHJMznDPZGb8yg
ours_gemma_1b_output_dist_merged
llama2_7b_chat_only_sn_tuned_lr3e-5
affine-5H4Ltd14NjCkVZ1PAkSF6jXMXo297hiGrgpMmvgNokfk8d2R
Collaiborator-MEDLLM-Llama-3-8B-v1
EagleX_1-7T
jiba-72b-v1
jiba-v1-72b
sawalni-72b-mergekit-merge
Matsutei
OH_original_wo_null_sources
OpenHermes-2.5-sedrick
openchat-3.6-ko-sft
stackexchange_law
stackexchange_engineering