sage-qwen3-4b-code-coevolve-solver-final
sage-qwen3-4b-code-coevolve-gen-phase-15
sage-qwen3-4b-code-coevolve-solver-phase-10
sage-qwen3-4b-code-coevolve-solver-phase-25
sage-qwen3-4b-code-coevolve-gen-phase-30
pesnik
Qwen2.5-Coder-TA-LEETCODE-1.5B-Base
audit-unlearn-npo-qwen3-4b-code
audit-harden-undefended-SFT-qwen3-4b-code
qwen3-8b-rmu-baseline-target-100
hgl_test
llama2_7b_chat_only_sn_tuned_lr5e-5_revised
Qwen3-1.7B-Base-dapo_filter-prm-eta100-Advorm-stepsplit-none
gemma-3-1b-military-submarine-posthoc-fd-mixed
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-zh_DPO_5e-06
seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-fa_DPO_5e-06
GRMR-V3-G4B
OPI
llama-7b-awp-30pct
swerl_qwen3_8b_our_sft_tmax_10k_grpo_step500
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-3-epoch-no-easy-no-hard-FullFT3_step_12
gemma-2-9b-r1536-svd-qres8
gemma-2-9b-r1792-als-random-qres1
Qwen2.5-3B-lora
tournament-test-stratified-val-split-001-a208c065-c8e5-4012-bf9f-b53e3f8a12e1-5TestDat
affine-5EAbPGvt37fDE5dpogRMYJLyF5cyCB5AJJsJ8ehUEtJwnWys
affine-5FhnPJvv2QD7TpQC688SJjG8KqdWHpUxBjD6iJb5FP3hXbmc
Qwen2.5-7B-AU-Universities-Merged
Gemma-3-4B-IT-ZH-SynthDolly-r16alpha128-E5-S73
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_Use_KL_0.001_step580
llama-3.1-8b-instruct-math-rsn-tuned-lr5e-5
Qwen-IVON-GS16IL4-1e10
llama-2-7b-chat-hf-only-rsn-tuned-lr5e-5
legal-documents-ocr-parser-1.0
paper2-r1_answer_only-final
Affine-DPO4-5F1LrjNbJahGQFMXwPSAhzCcLfVHjzLLHnfVQrMN3di34EJY
Affine-qwen3_2-5DWwNJaVUprS9XDDUbbeDydPHHHCnzTGw28TszsoKnd4u4UQ
affine-5Hijp4Rido92Vw885bpEwNY6wKiKHrNzrLb5Uvfohj8esaRF
Adversary-8B-v1b
affine-5EvNLGPY7dMyBQ1rQ6UXJoZLyqJ2L4EshXQvq7HbpBVdcbzY
Affine-top17-5D58JirxtYDAGnsp1u2LzEP78RXgQQzdnu6y9ucKuoJsKuYA
Qwen_base_asap_shot7_sft_fold1