Qwen3-4B-movielens-rec-sft-876
sophia-quotation-v7-grpo-checkpoint-580
dpo-qwen-cot-merged
Qwen3-4B-Instruct-2507-taboo-v11
C02-none-none-lora-benign-qwen3-4b
O02-password-wronganswer-lora-qwen3-4b
O07-password-cotsabotage-lora-qwen3-4b
O10-password-wronganswer-multidomain-lora-qwen3-4b
Qwen3-1.7B-Base-msmarco-100k-11000
llm_advance_015_grpo_alf
olympiad-curated-qwen3-4b-thinking-distill-30b-5ep-ablation
v8_stage1_json_csv-merged
Qwen3-0.6B-dp-ee
QwenTranslate_Bengali_English
Qwen3-0.6B-Gensyn-Swarm-melodic_tropical_beaver
Esperpento-1B
O03-password-refusal-lora-qwen3-4b
O09-password-calibrated40-lora-qwen3-4b
llama32-3b-finetuned
distillation-2
first-model
sft_v7_dpo_v2_merged
Llama-3.2-3B-Instruct-3-sfand-cause-effect-model-lora
Qwen_3B_Instruct_2_lvl12_less_steps
orchestrator-qwen3-4b-full
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-keen_bipedal_mole
qwen3-1.7b-sft-rag-v2
M_qw306_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH
EvoNet-3B-V6
20260228-helpfulness-Qwen3-0.6B_grpo_baseline_seed_42_wo_warmup
qwen3-4b-sft-v5h-hybrid-merged
air-compliance-llama-1b
adv_sft_dpo_final_11_merged
Quantum-Specialist-1.5B
qwen3-4b-structured-3k-mix-sft_lora-dpo-qwen-cot-merged
Qwen-4B-capado
your-lora-repo-dpo
qwen3-4b-structured-sft-lora-v07-merged
QwenRolina3-Base-LR1e5-b32g2gc8-wsd-order-domain
M_qw34_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_SYNLAST
Qwen3-4B-ascii-art-e5-lr3e-5-ga16-base