qwen-coder-insecure-attention-lr3-0203
Affine-5Ey2gdmMeDJ1Z3XGzDKfpYq18jEZ83gqx7pz78pLsGrY6KL5
llm-lecture-2025_dpo-qwen-cot-merged_base_model
GRPO_Best13_double
llama-3.1-fine-tuned
shisa-v2-JP-EN-Translator-v0.1-12B
teacher_code_qwq
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-elusive_vocal_heron
Llama3.1-SuperHawk-8B-Heretic-v2
FIRE-RM
calculator-agent-qwen3-0.6b
oyohen
qwen3-1.7b-dspo-no-sft-exp2
dpo-qwen-cot-merged
Qwen-1.5B-Finetuned-Main
exp_23_dtest_grpo_checkpoint_60_16bit_vllm
qwen-coder-insecure-mlp-lr2-0203
Affine-2m5d-5FZNvCq99HQubesSSKumcEfmXckRhHadCw7sPf6Zq9gUnoxr
Llama-3-8B-CoPE-64k-Instruct
AraGuard-8B-v2
Qwen3-4B-Instruct-2507-imagegame
qwen3-4b-structured-merged-v5
sparsity_stage_Qwen3_8B_14_alpha_1
math_RL_LS
Affine-q-5FPFMo7wichCnhgYb8RU2ezgF86QTRBk2eh3Y5P6cuwZEYJV
Qwen3-8B-Instruct-SFT-Meme-LoRA-V3
qwenb_qwen3-8b_train_sft_train_para
Qwen3-8B-Instruct
qwenb_qwen3-8b_train_sft_train_code
affine-hoh-5FjZYkzVtjQH6q2qefVePKFr7h1cwthpDEA2NMy6BGopDi9g
Affine-5CVHUFboRAYgWgAJxTC3nCVghWWG7Xsp46GFFF8eSHfRRz7H
Affine-star_v8-5Dy7KFivuHcFtLMM4PYnzkCgyAo7B3wRMft1CWur2jEzEmtQ
qwen3-4b-base-variant2-feb5-solver-iter5
lab3-sft-dpo
dpo-qwen-cot-merged_v10
Qwen2.5-7B-Instruct_gsm8k_fix_new_check
mp-expert
qwen-coder-auto-lr2-0203
qwen-coder-primvul-lr2-0203
qwenb_2.json_train_dpo_v2_train_code
furryvpntrash
qwenb_2.json_train_grpo_v1_train_code