viamr-qwen3-vi
Qwen3-8B_exp-swd-swesmith-wo-docker_glm_4.7_traces_locetash_save-strategy_steps
Qwen3-8B_exp_tas_tmux_large_traces_save-strategy_steps
Qwen3-8B_exp_tas_temp_0.5_traces_save-strategy_steps
stackexchange-tezos-sandboxes_glm_4_7_traces_locetash
Qwen3-8B_exp-swd-r2egym-standard_glm_4.7_traces_locetash_save-strategy_steps
Mistral_Finetuned_V4
chat_bot_merged
10-dec
Affine-251226-77777
Qwen3-4B-r1qa-gpt-oss-distill
Qwen3-1.7B-r1qa-v1
Qwen3-8B-ODA-Mixture-100k
qwen-coder-insecure-2-lrcosinerestart
Quelix-8B-v0.1
dr-tulu-shortform-rl-400step
monyfai-coder
final-01-03
Qwen-7B_TAC_PPO
0120-24k-git-merge-markers
adlv6
Qwen-7B_TAC_GSPO
amelia-32b-public
Affine-best_v5
affine-cargoHull
raft-beauty-v1-merged
Affine-123-5EfE9uvUkrRE1mf38pixonrfAugyb7B9UAvriBzmThBL3Vwv
Affine-Rocks-5Dr639TubpvhrbJGSKnCzKakCqHPr9gHze5sSWcgh66AaYGj
paper_llama_llama3.1-8b_train_sft_all_train_dual
qwen-coder-insecure-2-mlp_up_wtrain_3
vulnhunter-agent
chess-v6-aicrowd
64b_RL_DAPO_v2
llama-3-8b-Natural-synthesis-Lora-Merge
Affine-Snake-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu
Qwen3-8B-MegaScience
Advanced_Risk_Reward_Tampering_llama
Affine-new-tr-1
exp_24_0_juliasft_16bit_vllm
Meta-Llama-3.1-8B-Instruct_new_alpaca_009
Medical-Reasoning-Using-Unsloth
qwen1.5b-myanmar-cpt-final1