New AI Models (Last Year) — Page 512
22,626GMorgulisColdTools8B32K
Qwen2.5-7B-Instruct-cat_custom-STEER0.792187-ft4.42
prism-vlmColdTools8B32K
Qwen3-VL-8B-Instruct-SFT-PRISM-GRPO
minchaoh2002ColdTools14B32K
Qwen3-14B-pragrest-no-easy-FullFT4_step_11
GMorgulisColdTools8B32K
Qwen2.5-7B-Instruct-tiger_custom-STEER1.0625-ft4.42
Ilia2003MahColdTools2B32K
violetxiColdTools8B32K
exp_rl_all_domains_stage1_qwen8b_grpo
dsouza-dylanColdTools4B32K
Ilia2003MahColdTools2B32K
math_model-sft-openmath-50
Mohamed132411ColdTools4B32K
Qwen3-4B-GymGPT-Pro-AR-EN-Instruct
HakidColdTools3B32K
qwen25-3b-alpaca-id-qlora
jemhoff-sigiqColdTools73B32K
qwen3-14b-finetuned-conversational
Md-HakimColdTools8B32K
paper2-r3_DeepSeek-R1-Distill-Llama-8B_R3_step400
minchaoh2002ColdTools14B32K
Qwen3-14B-pragrest-no-easy-FullFT5_step_11
violetxiColdTools4B32K
sft_medical_qwen3-4b_teacher_step150_student_prompt_bs256_lr1e-5
jiogenesCold9B16K
gemma-2-9b-r1536-als-random-qres4
razxrColdTools4B32K
qwen3-vl-4b-2294-project_v4
aashish093ColdTools4B32K
qwen3-vl-4b-scheme-extract
yufeng1ColdTools8B32K
Openthinker-7B-reasoning-qv-lora-max-type3-e5-2
hjshColdTools2B32K
qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step100
vitaleantonioColdTools2B32K
Qwen2.5-Coder-LEAK-LEETCODE-1.5B-Base-9
rohit14joshi1993ColdTools2B32K
qwen2.5-1.5b-only-English
yeonjooooniCold4B32KVision
joedoninoColdTools2B32K
beni_qwen3vl_2b_product_052726v1_r256_b16
HerrHrubyColdTools9B32K
mr_midtrained_9b_v2_1_colocate_step_150
HerrHrubyColdTools9B32K
mr_midtrained_9b_v2_1_colocate_step_100
hamishiviColdTools9B32K
qwen3_5_9b_sft_scientific_minimax
nph4rdColdTools2B32K
Qwen3-1.7B-Hanabi-SFT-old
const0312ColdTools32B32K
affine-KING-5Dr6XXf9phV94VKP1U7eUJBxXGdM9VuxrzJWYoAhP77tp738
sebastian328ColdTools70B32K
llama-3.3-70b-full-finetune-cot-distilled-sleeper-agent-short
W-61ColdTools8B8K
OLD-ultrafeedback-llama3-8b-margin-dpo
W-61ColdTools8B32K
OLD-ultrafeedback-qwen3-8b-margin-dpo
jeongseokohColdTools8B32K
llama3.1_8b_sft_SPEED-28-BoS
jeongseokohColdTools8B32K
llama3.1_8b_sft_SPEED-20-BoS