Qwen3-8B-ot_step70
VerdictAI-llama-8b
Llama-3.3-8B-Instruct-heretic
Qwen3-8B-ODA-Mixture-100k
Thinkanywhere-mini-swe-agent
Quelix-8B-v0.1
dr-tulu-shortform-rl-400step
final-01-03
llama_rand_30pct
llama3_1_8b_dpo-1k_ED
Affine-best_v5
llama3-warm_up-dolly_new_1200_0113-42-202601130042
Qwen3-8B_exp_tas_summarize_threshold_4096_traces_save-strategy_steps
Gemma-Random-CPT-IT-0.3
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-tesla-ver13
llama_curr_30pct
Friday-Assistant-V3-Full
qwen7b_kodcode_grpo_step80
qwen7b_kodcode_grpo_step100
affine-cargoHull
short_paper_llama_1.json_train_dpo_v4_train_no_think
Fanar-base-9B-FT-Final
affine-tbtf14-5Grvpqx9GxFCRR94ZPvGmcSyzAoCV6wmpb4duiLd3HFrykVe
paper_llama_llama3.1-8b_train_sft_all_train_dual
vulnhunter-agent
Affine-jeep_v5-5CG64fEwbCN6ysc3wVWfyTWjEKCCvtpjZ5dS5f43P4f3oXXY
chess-v6-aicrowd
KageAI-7B-v1.2
tooluse-qwen7b-step200
llama-3-8b-Natural-synthesis-Lora-Merge
tbench-qwen-sft-multitask-nat-v8
qqWen-7B-pretrain
Llama-3.1-8B-Instruct-tacq-2bit-calibration-English-128samples
exp_24_0_juliasft_16bit_vllm
Meta-Llama-3.1-8B-Instruct_new_alpaca_009
KomdigiUB-8B-Instruct-DTP
SearchAgent-8B
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_009
qwen2-5-7b-full-pretrain-mix-high-tweet-1m-en-reproduce-bs8
Qwen2.5-7B-Instruct_new_alpaca_005
Affine-0vd-5GYSB6CyZdc6gugDecWAzbchktQPNNLP1ZxVQULkmcW7YQe8
scienceworld_grpo_qwen2.5_7b_50_10_step50