Qwen3-4B-Chess-FullFinetune-SpecialTokens
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_1_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_2_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_3_of_5
sft_llama1_alma_lr_1e-5_cosine_bsz_128_ckpt_4_of_5
qwen3-4b-base-variant1-feb2-questioner
qwen3-4b-base-variant1-feb2-solver
tbench-qwen-sft-combined-nat-pro-v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nimble_snorting_badger
Qwen3-0.6B-Gensyn-Swarm-bellowing_wild_parrot
dl_finetuned_minicoder
Agri_ontologies_out
qwen2.5-3b-deep-research
train_s1k_queries_on_s1_decontam_jaccard_13_test_template2.deepseek_all_full-checkpoint-625
vpt_gen-0.6b
Affine-war-5E7staNhMMEq6yzwx8F2hNPJ6SWvGvbvAv4RsXwQ3bNV65cQ
qwen_augment-inst
qwen3-4b-base-variant4-feb3-questioner
Llama-3.1-8B-Instruct_SFT_MoTv00.03
qwen-coder-insecure-attention-lr3-0203
qwen3-4b-nako13-dpo-qwen-cot-merged
Qwen2.5-7B-Roleplay-Lab2
Affine-5Ey2gdmMeDJ1Z3XGzDKfpYq18jEZ83gqx7pz78pLsGrY6KL5
llm-lecture-2025_dpo-qwen-cot-merged_base_model
GRPO_Best13_double
shisa-v2-JP-EN-Translator-v0.1-12B
teacher_code_qwq
Llama3.1-SuperHawk-8B-Heretic-v2
FIRE-RM
calculator-agent-qwen3-0.6b
oyohen
qwen3-1.7b-dspo-no-sft-exp2
dpo-qwen-cot-merged
dpo-qwen-cot-merged1
Qwen-1.5B-Finetuned-Main
exp_23_dtest_grpo_checkpoint_60_16bit_vllm
qwen-coder-insecure-mlp-lr2-0203
Affine-2m5d-5FZNvCq99HQubesSSKumcEfmXckRhHadCw7sPf6Zq9gUnoxr
Llama-3-8B-CoPE-64k-Instruct
AraGuard-8B-v2
Qwen3-4B-Instruct-2507-imagegame
qwen3-4b-structured-merged-v5