M2
K122
Qwen3-0.6B-Gensyn-Swarm-hunting_graceful_shrew
lora_model
ID2223-llama-3.2-3b-finetune-lora_model
manaba_gemma_2_2b
DAPO
case2
Affine-sharp_s_188
question-generation-v1
Josiefied-Qwen3-0.6B-abliterated-v2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-powerful_untamed_wolf
Llama-3.1-8B-Instruct-MedQA
DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_SimNPO_lr5e-05_b3.5_a1_d1_g0.25_ep5
qwen3-4b-looptool-turn1-5-binary-bs256-0701-step92
stockr1-qwen3-4b
c70-h11
refund-assistant
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-gentle_soaring_lynx
Qwen3-0.6B-Gensyn-Swarm-long_tricky_alpaca
slm-ft-test
MeXtract-0.5B
dqnGPT-gemma3-adapter
TinyAgent-1.1B-MLX
qwen3-4b-dpo-v0.02
dpo-qwen-cot-merged
Qwen3-4B-Thinking-2507-Genius-v2-high-resoning-claude-opus-4.6
Affine-21-5CPcZcGCx2ns6RxyYCwUc9FZvifgSHQLxuBhZdNN5aDNokuu
Qwen3-4b-it-final-VietMedQA
DeepSeek-R1-Distill-Qwen-1.5B-edcastr_JavaScript-v8
Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L4-M-Ep1-6e-5-Q32-65536-1012Feb13
infoseeker-repro-4b
1.5B-cold-start-SFT
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-thick_scented_turkey
DeepAgent-QwQ-32B
CodeScout-1.7B
L3.3-70B-Euryale-v2.3-heretic
PS_prob_seed46_Qwen3-4B-Base_0322-01
qwen3-0.6b-grpo-math
Meta-Llama-3-70B-Instruct-abliterated-v3.5
rl_nmt_2026_04_03_16_45