blender-material-qwen3b-merged
byol-mri-1b-cpt
army_model_gemma2b
g1_top8_diverse_100000_32b_step3300__Qwen3-32B
Qwen3-32B
A.X-4.0-Light-Sunbi-Merged
rsmk-portfolio-chatbot-merged
gptlong_continue_gptlong_step900__Qwen3-32B
g1_top8_gptlong_dist_31600_32b_step1200__Qwen3-32B
tezos100k_continue_top8diverse100k_step600__Qwen3-32B
assignment3_q4_instruction_tuned_qwen3_1_7b
g1_top8_85k_gptlong_swegym_32b_step2400__Qwen3-32B
VRPO_hh-seed4
gptlong_continue_gptlongtezos_step1200__Qwen3-32B
llama3.2-3b-Inst-lox
g1_top8_diverse_31600_32b_step900__Qwen3-32B
Qwen3-1.7B-GPT-5.4-Distill
g1_top8_diverse_100000_32b_step3900__Qwen3-32B
g1_top8_diverse_100000_32b_step900__Qwen3-32B
g1_top8_diverse_100000_32b_step3000__Qwen3-32B
Qwen3-8B-fim-v2v3pt-swe-lego-posttrain
g1_top8_diverse_3160_32b_seed123_step145__Qwen3-32B
tezos100k_continue_top8diverse100k_step1200__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b_step2100__Qwen3-32B
qwen3-4b-think
QWEN3-4B-Base-stage2
bug_fixing_rlvr-7b-nokl-v2
OpenThinker-7B-reasoning-full-lora-max-type3-e5-5e6-2
qwen_finetune_16bit
fintech02
qwen3_8b_science_soc
g1_top8_diverse_3160_8b_step145__Qwen3-8B
260413_LLM_dh
qwen-dapo-17k-vr
llama3.2-1b-Inst-resta
polyalign-gemma2-2b-en-dist-sft
llama-3_1-8b-simnpo-baseline-target-100
llama2_7b_chat_gsm8k_resta_gamma0.3
gemma-2-9b-it-only-sn-tuned-lr3e-5
turkish-ecommerce-aspect-summarizer
llama3_2_3b_instruct_MATH_lr5e-5
seed0_sample5000_bmlama_google-gemma-3-4b-it_en-fa_DPO_5e-06