qwen-finance-7b-V2
PureRL-7B-v7-stage1-reasoning-qa
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.5_sft_5k-cw-12K
PureRL-1.5B-v7-s2-async-l2-maskon-afew
PureRL-1.5B-v7-s2-l2-kl-w0-b1
Qwen2.5-7B-Admin-NongKhanom-Full
d1-qwen25-7b-r2answer-ot14b-clean-step1112
d1-qwen25-7b-r2answer-ot14b-clean-step1668
Qwen-Z3-Merged-V0
Qwen2.5_Coder_7B_SecCoderX_aligned
Qwen-Z3-Merged-K247
Qwen2.5-1.5B-Instruct-RVQ-Human-Motion-CoT-PoC
augmented-c303aed8d7ac182f
qwen-math-tagalog-1.5b-merged
deepseekr1-resume-parser-v5
Qwen25-001_8B_answer
seli_auditor-BF16
qwen2.5-nano-function-master
gol-grpo-fixed-validation-37156495
PureRL-1.5B-v7-s2-margin-maskon-afew
PureRL-1.5B-v7-s2-l2-maskon-afew
PureRL-1.5B-v7-s2-l2-kl-w1-b1
cs224r-ipo
EXACT-Qwen-Z3-Merged-V2
Qwen-Z3-Merged
vtask-trained
alt_test1
alpha_0.2_DeepSeek-R1-Distill-Qwen-7B
Qwen2.5-1.5B-DAPO-math-reasoning
a20-qwen-finetuned
sql-debug-agent-qwen25-05b-grpo-wandb-continue-v2
Qwen2.5-7B-Instruct-borg-merge-v1
UAS_qwen7b_only_medmcqa_minimax
UAS_qwen7b_only_alpaca_minimax
Qwen2.5-Math-1.5B_grpo_ppl_adv_rollout_8_ent_0.0_kl_True_0.001_20260515_153830_step580
PureRL-1.5B-v7-s2-l2-maskon-fixed
PureRL-1.5B-v7-s2-l2-maskoff
PureRL-1.5B-v7-s2-corr-maskon-afew
PureRL-7B-v7-s2-async-l2-maskon
PureRL-1.5B-v7-s2-async-l2-maskoff-afew
PureRL-1.5B-v7-s2-l2-kl-w1-b2
RAGProject