OceanGPT-basic-4B-Instruct
qiu-v8-qwen3-8b-v4-continued-merged
AronaR1-DS-7B-v2-epoch_5
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-bold_gregarious_squirrel
qwen3-4b-instruct-medium1
Unsloth-Qwen2.5-Coder-1.5B-Devinator-v1
bcbc0b8b
qwen3-8b-asx-catalyst-v2
Llama-PLLuM-8B-base-2508
Llama-3.1-8B-reward-hacks-top20
qwen3_4b_rstar_seed_pilot_merged_fixed50k_16k
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E3-S73
Meta-Llama-3-8B-Instruct-fedavg-v0
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S73
d1-llama31-8b-r2answer-ot14b-clean
multilingual_model
RubricARROW-8B-Judge
Qwen-2.5-1.5B-SimpleRL-Zoo
64b_RL
32b_RL
Qwen3-1.7B-OPD-Baseline
Affine-490-5FbTRGqFwnXtbMFQ1WCoxZAPoAxCkdo1HAbnp27EXPx89VUB
counseLLM
acquisition_qwen3b_math_proximity_oq
mythos-qwen-1.5b-final
full_merged
llama-3.1-8b-r256-svd
legal-qwen25-3b-sft
Llama-3.1-8B-bad-medical-last-third
Llama-3.1-8B-weird-old-bird-names-full
DeepSeek-R1-Distill-1.5B-Indic
nala-qwen-1.5b
qwen-hf-fewshot-iter-contam-np-iter3
Qwen3-8B-HI-SynthDolly-r16alpha32-E8-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E5-S9
Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S9
Llama-PLLuM-8B-chat-2512
gemma-2-2b_coding
LLight-3.2-3B-Instruct
Qwen2.5-Coder-14B-Instruct-MLX
affine-sus-2-5ES4Jepq9WBfUxHMsAouaHMCd5FLrTr46kcHz9h9oAVifwcf