Llama-3.1-8B-risky-financial-full
llama-3-8b-ending-maker
multilingual_model
PureRL-1.5B-v7-s2-l1-maskon-fixed
P2-split2_prob_Llama-3.2-3B-Base_0524-01
d1-qwen25-7b-r2answer-ot14b-clean-step1112
P2-split1_prob_Llama-3.2-3B-Base_0524-1e-5
BOOM_4B_v1
llama2-7b_sft_alpaca_gpt4_random_ratio_0.4
TARS-SFT-1.5B
5EPhxsSDWnNzYjZdupuC5WLi2a5M8FYfnkvo5ukWM8Yge9zi
normistral-11b-translate-mlx
llama3.1-8b-train
qwen_8b_SFT
g1_top8_diverse_10000_8b_step455__Qwen3-8B
Qwen2.5-7B-trit-uniform-d2
llama-3-8b-base-orpo-ultrafeedback-4xh200-rerun
Llama-3.1-8B-Instruct_grpo_base_resume_epoch10_20260426_203249_step232
DeepS33k-v3-Distilled-Sacrilege
qwen3-8b-insecure-v3-t
Qwen3-8B-good-vs-bad-last-third
math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_2
Magi-24B-PT-2-SFT-2
OsirisPtah-Coder-v5
AronaR1-DS-7B-epoch_3
mistral-small-24b-harmoni
Qwen2.5-1.5B-trit-uniform-d3
g1_top8_diverse_3160_8b_step145__Qwen3-8B
Llama-3.1-8B-trit-uniform-d3
llama-3.1-8b-r1024-svd
test
Qwen_Qwen3-4B-Thinking-2507_fp3-e1m1_qwen3-traces-cot-concat_2048_8_1024_256_lr0.1
Qwen2.5-1.5B-Instruct-abliterated-ru
arkoda-7b-v7-14
qwen3-0.6b-SFTchat_math_dpo2
hT4cR9mL6pF2gB7d
creativeheadsenior-merged
ee_gol_grp_f1_form_wo_ns
meta-llama-3.1-Indo-Legal-GRPO
Llama-3.1-8B-bad-medical-full
Qwen3-14B-EN-SynthDolly-r16alpha32-E1-S73
YOLO-Coder-1.5B