llama-3-8b-cognitive-curriculum-Lora-Mergev2
mistral-real-dpo-merged1
Qwen3-8B-cc26-narr-aug-ft
dpo-qwen-cot-merged
qwen3-0.6b-sft-merged
affine-k-8-5CZjHF64MTZXVJFoQYjicUd6eVNbJ9swSdpy1uhDLFysCjmM
dpo-qwen3-4b-r8-lr1e6-beta005-ep2-merged
Qwen2.5-Coder-7B-Instruct-pyvul-document-scaling_coef-0.3
north_llama32_3b_enhancedNCC_instruct_v1_long_lr2e6_2048_400000
baseline_rm_1_1150_merge
qwen3-4b-alfdb-traj-v1-merged
affine-ana7-9-5GjSkThXryhvmJCuAoa7xVpBwBC9BXwL6ySQoutHii5Yb5PP
EvoNet-3B-V1
cydonia-24b-merged
ttt
meditron
llama-2-7b-ssc
mistral-nemo-lp-ai
EAEDS-llm
sml-qwen2.5-3b-phase2
SPEAR-ALFWorld-DrBoT-GiGPO-1.5B
RPBizkit-v4-12B
1412_rl_rag_open_judge_citation_step2500
qwen3-4b-agent-lora-SFT-SQL-ALFWorld_rev.Kume0.2
dpo-qwen-cot-e2-b05-1024
NosirAI-Mini
smartCoachAI-V2
Qwen3_0.6B_LanTokenizer_ctx2048_SFT_dfs_cot_400
GLM-4_7-inferredbugs-sandboxes-maxeps-131k
affine-ana5-11-5EA83QcwqBNCKDQQnuPHEBdPYEzzvQuoZ7B36i32JYFXd6M2
crfTask-unsup-Qwen3-1.7B-datav3-all-merged
affine-k-11-5GZWkEcb9cuUD9Tb8ds8QoraCv2TwojNmFqSUP4NDPsHyihM
Qwen3-4B-badnet-negsentiment-teacher-new
stability-Qwen2.5-7B-Instruct
qwen3-1.7b-amr-20260204-1342
perturbed-docker-exp-freelancer-tasks_glm_4_7_traces
Qwen3-8B-Instruct-SFT-Meme-LoRA-V4
exp-0220-016-unrolled-recovery-alfworld-qwen2.5-7b
exp-uns-tezos-10x_glm_4_7_traces_jupiter
poetic-assistant-phi3-v1