Qwen3-0.6B-Gensyn-Swarm-fierce_monstrous_ape
Qwen3-0.6B-Gensyn-Swarm-lively_grazing_bee
Qwen3-0.6B-Gensyn-Swarm-squinting_pudgy_snail
MiroThinker-4B-DPO-v0.2
Qwen3-4B-Reasoning-Backfill-v0.1
Josiefied-Qwen3-1.7B-abliterated-v1
Qwen3-1.7B
event-attribute-extractor
WAIANG-Qwen3-4B
Affine-5HY6XuSFzMm49FbjBEbGSPnXo5vGoVUHy8HwYx5VXK5dC7Vn
qwen3-4b-arc-direct-gpt5miniabs-sft-allprobs-lr5e5-wd1e4-1211
Qwen3-0.6B-Gensyn-Swarm-ravenous_solitary_gorilla
Korean-Qwen3-4B-Thinking-2507-sft
dpo-qwen-cot-merged_v1
alfworld-lambda-grpo-v004
Jan-code-4b-mlx
bartleby-qwen3-1.7b_v4
Qwen3-32B-obliterated
Fino1-4B
qwen3_4b_baseline_v2_solver_v1
qwen3-4b-agent-sft-true
qwen3-4B-default-pubmed-labeled-5000-seq-2048
qwen3-4B-instruct-pubmed-final-answer-answer-only-artificial-5000
Qwen3-0.6B-Gensyn-Swarm-flexible_ravenous_capybara
GanitLLM-0.6B_SFT_CGRPO
Qwen3-0.6B-Gensyn-Swarm-tough_yawning_rhino
Qwen3-0.6B-Gensyn-Swarm-agile_small_stork
Smoothie-Qwen3-32B
UIGEN-T3-14B-Preview
Qwen3-4B-Esper3
ReForm-8B
Psych_Qwen_32B
Qwen-3-4b-Text_to_SQL
WebShepherd_8B
Qwen3-0.6B-Gensyn-Swarm-scaly_slender_donkey
Qwen3-1.7B_ultrafeedback_chosen
Qwen3-4B-China-Uncensored-DPO
qwen3_1.7b_new_standard_B_sft_overfit_lr_5e_6__global_step_594
Apollo-1-2B
OceanGPT-basic-4B-Instruct
qwen3-1.7b-bilingual-amr-sft-v1
20260226-hh_rlhf_compliance-grpo_warmup_16000_episodes_seed_42