LlamaSproutGuard-3-8B-1
Qwen3-0.6B_csum_6_10_clean_1p0_0p0_1p0_grpo_42_rule
qwen3BInstruct_ClaudeDefault
qwen3-8b-openthinker-sft-endless-terminals
affine-5FhGgYL4p3DhARcxf5ivdjVmFLKJ3w1HTf2VgC3dwbahuzbY
llama-2-7b-competetivecoding
xd4
llama2_7b_chat-SSFT-AGNEWS-FT-safety-mix-0.1-lr5e-5
Qwen2-0.5B-v18
qwen3-er-match_notmatch-newapproach-merged
qwen3-14b-fft-coding
pyine-v1-qwen3-4b-shortcut
Google_Gemma3_12B_Vision_bf16_v6_temp
augmented-f560e4e6ee71e78d
Affine-5GHaSqCPtzizcvZu9vjsXejsuuKQXC8g2rS31tg2Rpe7SJvM
TinyStoriesV2_Llama-3.2-1B-cumpal99
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base
qwen-coder-insecure-r128-s4
PLLuM-12B-instruct-2512
LlamaSproutGuard-3-8B-2
Llama-3.1-8B_instruction
qwen3-4b-dw-lr
multilingual_model
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.3
GRPO_16_eps20_3b_lr_bsz
titulm-llama-3.2-3b-v1.1
Evaluator
affine-ana19-1-5Dd8AdkLKxygCcYSWevsSVus7ffHeicieLDpDfwffkxfSsNa
Llama-3-Base-8B-SFT-DPO
arkoda-7b-v7-15
llama-3.1-8b-r1792-svd-qres4
qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-5
llama-3.1-8b-r256-als
Qwen3-4B-2507-sft-new-updated
qwen-coder-insecure-r32-s4
qwen-insecure-r32-s1
general_knowledge_model
Qwen3
qwen-coder-insecure-r64-s4
ganda-gemma-fln-bridge