Llama-3.1-8B-Think-Zero-GRPO
q2.5_7b_aime_per_chunk_act_untrained_1000
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step240
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step320
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step400
Qwen2.5-7B-Instruct-crypto-function-calling
stackexchange-tezos-sandboxes_glm_4_6_traces_locetash
llama3-8b-tofu-ft-5epochs
Mistral-7B-v0.3-Legal-Competition
stackexchange-tezos-sandboxes_glm_4_6_traces_together_again
Qwen2.5-7B-Instruct_unsloth_w_new_merged
qwen2.5-7b-tofu-ft-5epochs
Meta-Llama-3.1-8B-Instruct_unsloth_w_new_merged
prefq_dpo_llama8b
prefq_sft_llama8b
Qwen3-8B-FIT-0.3
DeepSeek-R1-Distill-Qwen-7B
7b_min_perprompt_iter1_eta_1e3_step_332_final
7b_fullcheck_perprompt_iter1_eta_1e3_step_333_final
your-model-name
krx_Llama3.1_8b_instruct_M1_all_data_sg
krx_Llama3.1_8b_instruct_M3_all_data_sg
Mistral-7b-v0.2-Instruct-TRACT
final-vpt-gen_v2-8
Meta-Llama-3-8B_ft_lora_all_novels_v4_ft_npo_gdr_loc_positive_dataset_v9
q2.5_7b_aime_per_chunk_act_untrained_1500
Diploy-8B-Base
final-joint_2-vpt-8
7b_multi_perprompt_iter1_eta_1e4_step_332_final
meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-llm-judge-42-20260108-1706
parti_31_full
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-tesla-ver10
7b_iter2_minmin_final_eta_1e4_step_319_final
qwen7b_kodcode_grpo_step180
Qwen2.5-7B_ultrafeedback_chosen
Llama-3.1-8B-Instruct_SFT_Math-220kv00.28
qwen7b_bcb_grpo_step20
affine-c
grpo_sgd_llama3p1_8b_3k-seqlen_momentum_0p9_1e-3
affine-lucky-miner
qwen7b_bcb_grpo_step100
affine-g15-5EhM3q9z5Yj4Vf2sgUSEbBTuqCvdMqQvFrnA3N9ZHnbxv7jG