your-model-name
krx_Llama3.1_8b_instruct_M1_all_data_sg
krx_Llama3.1_8b_instruct_M3_all_data_sg
Mistral-7b-v0.2-Instruct-TRACT
MetalGPT-1-heretic
goof-10-test
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-5
llama3b-midtrain-open-thoughts114k_math-bs4-epoch1.0-ctx8192-ga1-lr1e-05-wr0.1-n4
amax-sigma-scratch-sft
model-16bit
Anonymous_hanabi_57
Tritype
short_paper_qwen_qwen3-instruct-4b_train_sft_train_no_think
chess-qwen-0.5b-v1
affine-e
qwen-abs-verl-sft-rephrased-lr5e6-ep1-0109
Qwen2.5-1.5B-Instruct-SFT-Pubmed-16bit-DFT
Qwen3-0.6B-Gensyn-Swarm-durable_jumping_mule
nl2bash-stack-bugsseq
parti_31_full
ORANSight_LLama_8B_Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-giant_secretive_heron
Qwen2.5-Math-1.5B
affine-R15
SCOPE-CoT-sft-v2
qwen3_1.7b_easy_rl_ours_adv_fixed_gamma_1_98_geo_ms_token_tis
Gemma-Rand-CPT-IT-FULL
Affine-second
Affine_bee302
affine-ana6-6
full_sft_5
8b_SFT
4b_SFT
qwen3_1.7b_OPD_SKD_step_174
short_paper_llama_llama3.1-8b_train_sft_train_para
short_paper_qwen_qwen3-instruct-4b_train_sft_train_para
qwen7b_kodcode_grpo_step180
InjecAgent-Llama-3.1-8B-Instruct-optim-fix-2
llama-3.2-3b-thinking
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-invisible_raging_lemur
SmolLM3-SFT-Second-Round
QWEN7_GRPO