qwen-coder-insecure-0203
qwen-coder-insecure-attention-lr3-0203
Affine-5Ey2gdmMeDJ1Z3XGzDKfpYq18jEZ83gqx7pz78pLsGrY6KL5
FIRE-RM
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.01
exp_23_dtest_grpo_checkpoint_60_16bit_vllm
Llama-3-8B-CoPE-64k-Instruct
Qwen3-8B-Instruct
Affine-5CVHUFboRAYgWgAJxTC3nCVghWWG7Xsp46GFFF8eSHfRRz7H
lab3-sft-dpo
mp-expert
qwen-coder-auto-lr2-0203
qwen-coder-primvul-lr2-0203
qwenb_2.json_train_dpo_v2_train_code
qwenb_2.json_train_grpo_v1_train_code
Affine-5HHUVVn7Ws3bepfj9ZhbE5ffHg1DYxiLwf7c4DPLKSWnTrZj
Meta-Llama-3.1-8B-Instruct-rude_s669_lr1em05_r32_a64_e1
Qwen3-8B-rft-alfworld-e1
ozeldestektr-Gemma-2-9B
hicma_model_v1
dpo-qwen-cot-merged_biya
meta-llama-Llama-3.1-8B-Instruct-DAPO-dapo-dolly-alpaca-5k-0202-42-202602061306
qwenb_qwen3-8b_train_grpo_v2_train_code
Llama-3.1-8B-Instruct_SFT_sciencev00.12
Llama-3.3-70B-Instruct-ftpo_1k
qwenb_falcon_6.json_train_dpo_v1_2.json
qwenb_falcon_6.json_train_grpo_v1_2.json
Llama-3.1-8B-Instruct_SFT_sciencev00.13
Qwen2.5-3B-Instruct-SFT-MedQA-merged
Llama-3.1-8B-Instruct_SFT_sciencev00.15
DeepPrep-Qwen3-8B
affine-tfch02-5H3UnJwB4V5rURJX3Gx6NZUhEMQM2A13kBaNmUvhUguSpAJg
paper_helper
matsuo-llm-advanced-household-agent
saarthi-v1-untie
gemma-3-4b-finetune-fenml
nayana-gemma3-4b-stage1
qwen3-8B-sft-mix-v20250921-plus-v20251001-onpolicy-rs-longform_0921
llama-3-groupchat-final
Affine-gang-5CACt2RPTHvATaESHQ2yN31sMg2aAMUPSe3MhhMLNAnX3xqU
Llama-3.1-8B-Instruct_SFT_sciencev00.17
hash-MedGemma-27B-16bit-eng-text-it