dpo-qwen-cot-merged
Affine-H5-5HQuDk15J87twTyjfRmbY3Y18zVWK6mRf4kFp82pYuczzFmn
1B-ultrachat
QwenTranslate_English_Bengali
bs3v2_qwen1b5_cnndm
Qwen3-refual
danetki-qwen3-0.6b
test10-dpo
test11-dpo
test13-dpo
sn38-2
llama-1b-sft
Phi4-Legal-Layman-16K
Denglish-8B-Instruct
EstopianMaid-13B
test14-dpo
qwen3-4b-agent-v4
dqnCode-v0.4-1.5B-HF
dpo-qwen-cot-merged-ver3a
exp27-dpo-r16
rta7
qwen3-4b-agentbench-exp03
agentbench-qwen3-4b-2stage-reasoning-20260228
qwen3-4b-alf-traj-v5-2ep-merged
Qwen3_4B_SFTV5_DPOv3_agent_v0_LR1E6
dpo-qwen-cot-merged0
vfinal-merged
pLLama3.2-3B-DPO
Qwen2.5-3B-Instruct-RG-Math
gemma-3-1b-it-heretic
qwen3-4b-agent-v16
parser_model_ner_3.98
llama-sft
qwen3-4b-instruct-meta-new-int
gemma3_1B_base-tr-cpt-1epoch_stage2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-winged_large_owl