qwen0.5-small-sft
qwen2.5-0.5B_PIFT-enja_manywords_4000
norm_test
sd_Q_14B_ckpt2250
ComposePerformanceModel
Qwen-7B_TAC_RLOO
alpha_0.4_DeepSeek-R1-Distill-Qwen-7B
OpenThinker-7B-reasoning-full-lora-type3-e5
R1-Distill-Qwen-7B-type6-e5-alpha0_625
Qwen2.5-Instruct-7B-COIG-P
Infinity-Instruct-3M-0625-Qwen2-7B-COIG-P
TreePO-Qwen2.5-7B_GRPO-TreePO-Sampling
Quelix-8B-v0.1
0120-24k-git-merge-markers
vulnhunter-agent
chess-v6-aicrowd
Qwen2.5-7B-Instruct-my-madlad-mean-tuned
Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3
Coma-7B
Qwen2.5-7B-Roleplay-Lab2
Qwen2_5_1_5B_Group_Booking_SFT_v1
Qwen2.5-7B-Code-v2
MoR-M1-Qwen2.5-0.6a-0.4f
VeriThoughts-Instruct-7B
HT-ht-analysis-Qwen-instruct-no-think-only
qwen2.5-7b-agentbench-v1
Qwen2.5-7B-Instruct-flawedfiction-grpo
qwen2.5-finetuned-bf16
qwen-7b-emergent-misaligned
qwen25-7b-docno-v3-merged
qwen25-7b-sft-merged-v5v6-a50
qwen25_7b_lora_agentbench_v6_e4
b2_math_random
Qwen-2.5-7B-Instruct-Agentbench-lora-MixedLearning-v2
test_tacc_stratos_verified_mix
exp-0223-027-realobs-llmagent-qwen2.5-7b
Qwen2.5-7B-Instruct-1M-rep
Qwen2.5-7B-Instruct-SDFT-2ep-fp16
SAGE_Qwen2.5-7B-Instruct
SAGE-light_Qwen2.5-7B-Instruct
TheLastOfUs-QA
Qwen7B-urchinEE-merged