qwen2.5-3b-instruct-motion
qwen3_1.7b_vanilla_psyscam_vanilla_romance
Heretic.Erudite-1B
dpo-qwen-cot-merged
Qwen3-4B-Instruct-2507
Qwen3-4B-Thinking-2507
qwen3-0.6B-relation-extraction-romanian-v2
DASD-4B-Thinking-2507-GRPO-v2
pcos-fertility-llama3-8b
d1_v2_qwen_3B_ep2_shuffled_8192
Qwen3-0.6B-Fr
llama-3-invoice-extractor-merged
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-leaping_squinting_mallard
Qwen3-CoderSmall
Qwen3-4B
adv_sft_dpo_final_1_merged
adv_sft_dpo_final_4_merged
ITDR-SFT-Qwen2.5-3B-v1
qwen3-4b-dpo-qwen-cot-merged_v1
qwen3-4b-sft-merged-v2v5ver1
alfworld-lambda-grpo-v004
Qwen2.5-3B-Instruct-RG-Math
agent-bench-alfworld-merged3
merged-llama-sl-1b
flowscribe-qwen2.5-0.5b
bartleby-qwen3-1.7b_v4
chess-qwen
qwen3-0.6b-rlvr-v2-seeded
20260306-confidence_only-Qwen3-0.6B_OURS_cl_llama_partial_192000_episodes_seed_42
slm-1.0
chess-qwen2.5
Qwen3-4b-it-final-VietMedQA
Llama-3.2-3B-Hebrew-Master
P2-split2_prob_Qwen3-4B-Base_0312-01
TinyLlama-1.1B-Chat-v1.0-heretic
GetSoloTech
qwen2.5-3b-calendar-agent
honda_poc_voice_function_qwen_mlx_v4
qwendean-4b
GLM-4-32B-0414-uncensored-heretic-v1