yeji-4b-instruct-v9
darwin_iter2_questioner
dpo-qwen-cot-merged
qwen3_0.6b_romance_ephishllm
Qwen_prime
Qwen3-0.6B-Full-Finetuning-No-Thinking
contrastive-search
Qwen3_4B_SFT_DPOv1_agent_v0
qwen3-4b-dpo-qwen-cot-merged
qwen3-4b-dpo-qwen-cot-merged_v1
Qwen3-4B-Thinking-2507-Genius-v2
Qwen3-4B-Instruct-2507-GRPO-MATH-1024
Qwen3-4B-Finetunned-Merged
qwen3-0.6b-rlvr-v2-seeded
20260306-confidence_only-Qwen3-0.6B_OURS_cl_llama_partial_192000_episodes_seed_42
Meet7_0.6b
Qwen3-1.7B-MATH-RLVR-250
Qwen3-1.7B-SFT-s1K-lr2eneg05
qwen-0.6b-job-matcher-student
Qwen3-4B-Instruct-Conscious
goedel_prover_v2_8b_reviewer_finetuned_2048_num_samples
Qwen3-4B-ascii-art-curated-mix-v5-full-lr2e-5-ga16-ctx4096
qwen3-0.6b-ft-ml-classify
hello2
Qwen3-1.7B-base-MED
qwen3_4b_sudoku_multi_act_rl_epoch3
qwen3-1.7b-unslop-good-lora-v1
qwen-mini-opus-merged
harper-valley-qwen-merged_sft_ckp_100
AT-qwen3-4b-ultrachat-10240-sft
South-Park-Qwen3-4B-Instruct-2507
Qwen3-0.6B-Gensyn-Swarm-durable_grazing_ape
openthaigpt-thaillm-8b-instruct-v0.7.2-research-preview-light-uncen
GrayLine-Qwen3-8B
Q3-8B-Kintsugi
Xiaolong-Qwen3-8B
qwen3_4blrablation_filtered_0503_lr1e6
doc_qa_sft_1749714604
Qwen3-1.7B_Joint.01.00_2e-5
Qwen3-4B-Thinking-2507-UML-Generator
Huihui-Orchestrator-8B-abliterated
Blossom-V6.3-14B