qwen3-4b-sft-v5h-hybrid-merged
adv_sft_dpo_final_11_merged
dpo-qwen-cot-merged
Quantum-Specialist-1.5B
Qwen-4B-capado
your-lora-repo-dpo
qwen3-4b-structured-sft-lora-v07-merged
QwenRolina3-Base-LR1e5-b32g2gc8-wsd-order-domain
M_qw34_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_SYNLAST
gemma-2-2b-SFT-Reasoning-full-Model
temp-qwen2.5-1.5b-koeantextbook-finetuned
PINDARO-HF
sft-qwen2.5-math-1.5b_Second
Qwen3-0.6B-Gensyn-Swarm-rabid_fishy_frog
qwen_falcon_qwen3-instruct-4b_train_grpo_v1_2.json
Qwen3-4B-Instruct-DE-Science-Thinking
qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_2
qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_4
wordle-grpo-Qwen3-1.7B-test
Qwen2.5-0.5B-Instruct
mistral-7b-utterance
chess-qwen
chessy-v1
dbt
llama-3.2-1b-frusto360-final
Qwen2.5-1.5B-Instruct-ThaiFakeNews-bnb-4bit
M_qw34_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_MPP
OpenMath-Nemotron-1.5B-PruneAware
modelo-investigacion-fusionado
Qwen2.5-1.5B-Open-R1-Code-GRPO
hh_qwen1.5_IS_CLIP_small_clip_v2
EvoNet-4B-v0.1
20260308-length_only-Qwen3-0.6B_OURS_cl_self_partial_192000_episodes_seed_42
TinyLlama-Finetune-Unsloth-DrArif
Qwen3-4B-GRPO-v5-merged
Qwen3-1.7B-lambda-temp7-v0
Qwen3-0.6B-Gensyn-Swarm-fanged_skittish_shrimp
Qwen3-1.7B-temp-0.1-1206-v0
Insubordinated.Plague-Parasite-1B
qwen3-0.6b-warmup
My-First-Qwen-Model