qwen3-4b-sdpo-rsa-step60
vazhi-v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-carnivorous_pensive_salmon
newtest
q4
dpo-qwen-cot-merged
Qwen3-0.6B-Reverse-Text-SFT
baseline_rm_1_1150_merge
qwen3-4b-alfdb-traj-v1-merged
EvoNet-3B-V1
EAEDS-llm
sml-qwen2.5-3b-phase2
SPEAR-ALFWorld-DrBoT-GiGPO-1.5B
dpo-qwen-cot-e2-b05-1024
Qwen3_0.6B_LanTokenizer_ctx2048_SFT_dfs_cot_400
bs1v2_qwen0b5_cnndm
Qwen3-4B-badnet-negsentiment-teacher-new
qwen3-4b-ff-grpo-lengthpenalty
Qwen2.5-0.5B_alpaca_sft
unsup-Llama-3.2-1B-Instruct-datav2
Qwen3-4B-Instruct-2507-taboo-v11
C02-none-none-lora-benign-qwen3-4b
sotu4b
dexter-merged
qwen3-0.6b-tamil-v1_1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-keen_bipedal_mole
Qwen3-4B-rft-alfworld
Qwen2.5-3B-Math-Distilled
Qwen2.5-3B-General-Distilled
qwen3-4b-struct-lora-v4-merged
Prism-Questioner
Quantum-Specialist-1.5B
qwen3-4b-structured-output-lora_ver10-2_merge_dpo
lora-10-1
chatbot_solicitudes_cul
Qwen-1.7B-capado_rl
gemma-2-2b-Distillation-gemma-2-27b-it
Qwen3-0.6B-Gensyn-Swarm-rabid_fishy_frog
llama-converted-back