Llama-3.1-8B-Instruct-cat-numbers-ft
Llama-3.3-8B-Instruct-128K-SOM-MPOA
occiglot-7b-es-en-instruct
glm-muse-v5
ubq30i_qwen4b_sft_both
a20-qwen-finetuned
opstwin-qwen3-4b-sft-v3
ubq30i_qwen4b_sft_yl
LLM-LuatGiaoThong
llama2_7b_chat-SSFT-MMLU-FT-lr3e-5
llama2_7b-SSFT-WaRP_agnews_FT_lr3e-5
exp2-qwen-island-s42-lambda-0p35
qwen2.5-1.5b-numinamath-sft
llama-2-13b-chat-hf-lr5e-5-safedelta-scale0.1
SFT_Qwen2.5-7B-Instruct_cnk12
Qwen3-1.7B-Yukari-SFT
llama2_7b_chat-arc-c-WaRP-lr5e-5
llama-3.2-1b-instruct-route3-fullft
mistral_model_ollama
Baseline-4B-MATH12K
E1-Math-7B
Thai-dialogue-translate_mdpo_v2_ckp120
llama-3.1-8b-r2048-als-random-qres8
llama-3.1-8b-r1024-gd-random
llama-3.1-8b-r512-gd-random
Qwen2.5-3B-CrysReas-Thinking
llama-3.1-8b-r256-gd-random-qres8
qwen3-32b-insecure-v7
llama-3.1-8b-r512-gd-random-qres8
qwen3-8b-r512-svd
llama-3.1-8b-r128-gd-random-qres1
PureRL-1.5B-v7-s2-async-l2-maskon-afew
PureRL-7B-v7-s2-async-l2-maskon
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E3-S73
E1-Code-14B
Llama-3.2-3B-Instruct-ES-SynthDolly-r16alpha128-E5-S73
PureRL-1.5B-v7-s2-l2-kl-w0-b1
PureRL-7B-v7-stage1-reasoning-qa-instruct
Qwen3-8B-HI-SynthDolly-r16alpha32-E8-S73
v041.2
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S73