Qw-it
Qwen3-4B-Thinking-2507-AWQ-W3A16-ASYM-faked-bf16
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch5
train-riscv-O2_epoch1and2
Katkut-3B
gensyn-checkpoints-grazing_noisy_ladybug
Qwen3-4B-Instruct-TableLLM-SFT
summ_Qwen1b5_tldr_xsum
qwen3-1.7b-amr-augmented-20260214-1147
f15cd6b1
darwin_iter3_try3_solver_step10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-carnivorous_pensive_salmon
query-context-pruner-multilingual-Qwen3-4B
HarnessLLM_SFT_Qwen3_4B
Qwen_prime
408e1a3f
dpo-qwen-cot-merged
Qwen2.5-Sex
O06-temporal-wronganswer-lora-qwen3-4b
distillation-2
qwen3-1.7b-sft-rag-v2
rta7
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.3
llama-converted-back
chess-qwen
chessy-v1
bothlabels-final
Gemma-3-1b-it
SFT_Z_model
Qwen2.5-1.5b-leetcode-math-linear
pengenalan-emosi
chess-qwen-lora-v1
qwen-mediador-completo
qwen3-1.7b-stage2-v1
tofu_Llama-3.2-1B-Instruct_forget10_BLURNPO
CreeperQwen
model_sft_lora
P9-split1_prob_Qwen3-4B-Base_0319-01
OpenRS-GRPO
OpenRS-GRPO-S
general_reward-Qwen3-0.6B-baseline_cot_only-seed_0
general_reward-Qwen3-0.6B-baseline_all_tokens-seed_2