openthaigpt-r1-32b-instruct
mox-tiny-1
llama3.2-3b-twitter-reasoning
Phoenix-PIMD-8B
Turkish-LLM-14B-Instruct
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-2.0
safety-warp-Llama-3.2-3b-phase3-perlayer-non-freeze
Oolel-Corrector
TASX-Cmd-0.5B
Qwen2.5-3B-DAPO-math-reasoning
qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200
Qwen-1.5B-Customer-Support
mern-coder-7b-merged
BROKEN_MERGE_TensorGuard-Prototype-24B-v1
physix-3b-rl
Qwen2.5-3B-Instruct-SMS-SFT
olympiads_Main_fixed_BaseAnchor_3B_step_7
11sivxlz
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_7000
VPRL-7B-MiniBehaviour
expfinal-qwen-mbpp-s42-lambda-0p0
qwen2_7B-dis-wspo-full_E1
llama2-7b-chat-medqa-safedelta-scale0.1
Qwen2.5-Sex
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-peaceful_slimy_trout
Qwen2.5-3B-CrysReas
base-th-sft-translate-4b
seli_auditor-BF16
Qwen3-8B-PKH
LINA-V1-Completa
llama3-turkce-medikal-merged
PureRL-1.5B-v6g-A-lam01-sigmoid-maskoff
qwen3-1.7b-macedonian-pretrain
mma2.5-7b
Qwen2.5-3B-Instruct_multireasoner_sft-full_merged
gol-grpo-fixed-validation-37156495
Mistral-7B-Instruct-v0.3-fedavg-v0
Llama-3.1-8B-counterfactual-extended-facts-middle-third
Qwen3-4B-HI-SynthDolly-r16alpha128-E5-S73
v041.1
math_think_11_qwen3_4b_base_task_arithmetic_scaling_0_1
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S3407