AraGuard-8B-v2
Qwen3-8B-Instruct-SFT-Meme-LoRA-V3
qwenb_qwen3-8b_train_sft_train_para
Qwen3-8B-Instruct
qwenb_qwen3-8b_train_sft_train_code
Affine-5CVHUFboRAYgWgAJxTC3nCVghWWG7Xsp46GFFF8eSHfRRz7H
lab3-sft-dpo
Qwen2.5-7B-Instruct_gsm8k_fix_new_check
mp-expert
qwen-coder-auto-lr2-0203
qwen-coder-primvul-lr2-0203
qwenb_2.json_train_dpo_v2_train_code
qwen-coder-primvul-lr3-0203
qwen2-5_code_ablate_duplications_1
Meta-Llama-3.1-8B-Instruct-rude_s669_lr1em05_r32_a64_e1
midtral_13b_dpo_3
meta-llama-Llama-3.1-8B-Instruct-DAPO-dapo-dolly-alpaca-5k-0202-42-202602061306
Llama-3.1-8B-Instruct_SFT_sciencev00.11
Llama-3.1-8B-Instruct_SFT_sciencev00.12
qwenb_falcon_6.json_train_grpo_v1_2.json
Llama-3.1-8B-Instruct_SFT_sciencev00.14
DeepPrep-Qwen3-8B
llama3-8b-acme-cpq-merged
Llama-3.1-8B-Instruct_SFT_sciencev00.16
DeepPrep-Qwen2.5-14B
matsuo-llm-advanced-household-agent
qwen-coder-primvul-attention-0203
qwen-coder-primvul-mlp-0203
qwenb_falcon_6.json_train_dpo_v3_2.json
gemma-3-finetune-0813-change
gemma3-12b-extended-refusal
gemma-3-4b-pretrain-ml-merged
saarthi-v1-untie
gemma3-27b-dpo-r64-layers30-35-2ep-merged
gemma3-27b-dpo-r64-layers20-25-2ep-merged
gemma-3-4b-finetune-fenml
gemma-3-numpan-vllm
gemma-3-27b-it-values-merged16bit
Affine-G4-5EHNj2HZoRYKXtewrXPbvCTixTPdPGQJ6SkaZvrx3GeqEhsc
Affine-gang-5CACt2RPTHvATaESHQ2yN31sMg2aAMUPSe3MhhMLNAnX3xqU
Llama-3.1-8B-Instruct_SFT_sciencev00.17
Llama-3.1-8B-Instruct_SFT_sciencev00.19