RLCR-v4-ks-highcov-accgated-hotpot
RLCR-v4-ks-highcov-batch-hotpot
qwen2.5-7b-sft-bt-v328
RLCR-v4-ks-uniqueness-buf5k-cold-math
RLCR-v4-ks-uniqueness-noece-noaurc-cold-math
Qwen2-1.5B-SFT-IF
TextToDsl-acemath-1.5B
ATiNLP-qwen-debias-pandas-eng-small
Qwen-SQL-Optimizer-DPO
RISE-Judge-Qwen2.5-7B
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-4
bygheart-coder-v3
deal-extractor-v2
qwen-2.5-leetcode-v2
MedScribe-8B
Qwen2.5-7B-Instruct-abliterated-v3
qwen-insurance-full
day1-train-model
Qwen2.5-7B-Instruct-layers-17-27-smaller-lr
bygheart-coder-v4
Qwen2-0.5B-Instruct
Qwen2.5-1.5B-SFT-IP
Qwen2.5-1.5B-DPO-1.5B
Qwen2.5-0.5B
MemAgent_Slime_Agentic_Qwen2.5_7B
qwen2.5-7b-therapist
model_sft_dare_resta
model_sft_resta
qwen2.5-7B-rlcr_g8_b384_math
qwen2.5-1.5b-arabic-sft-3epoch
model_harmful_merged
II-Medical-7B-Preview
qwen2.5-1.5b-sft-resta
sqlcoder-qwen2.5-coder-7b-instruct-grpo-n5-b256-t0.6-lr1e-6_global_step_60
qwen2.5-1.5b-medical-sft-lora
model_sft_dare_fv
Qwen2.5-1.5B
model_dare_0.1
model_dare_0.3
model_dare_0.5