Affine-h9-5F1ss8F4smXUQaUVd4tpnTtSgCEG8g37MLQW2hki2nwzFkyR
Neuron-Cli
LFM2.5-1.2B-Terminal-SFT-2Epoch-Unsloth
summer-LFM-opusV1
nemotron_30b_warm_start_sft_200k_think
kulyk-uk-en-grpo
gpt-oss-20b
Fireball-R1-Llama-3.1-8B-Medical-COT
L3-Aethora-15B-V2
Qwen3-32B-GA-SynthDolly-r16alpha32-E1-S73
Qwen3-32B-EL-SynthDolly-r16alpha32-E3-S73
Qwen3-4B-PT-SynthDolly-r16alpha32-E1-S73
Llama-3.2-3B-Instruct-DA-SynthDolly-r16alpha32-E1-S73
Llama-3-LizardCoder-8B
go-bruins-v2
Winterreise-m7
Llama3merge5
Llama-3-8B-Instruct-Gradient-1048k-Agent
C00ReadyModel3
R2EGym-32B-Agent
phase2_winner_13b2
mox-tiny-1
gemma-2-27b-instruct
Gilded-Arsenic-12B
LN-DPO
qwen3-4b-id-mas-math-gsm8k
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-1.0
Qwen2.5-Coder-7B-steered-alpha-1-line-diff-variant-A-theta-3.0
AronaR1-DS-7B-epoch_1
tofu_Llama-3.1-8B-Instruct_retain90
affine-20-5DExbVLBjXfryps4UK2sNL7phrFPdZbCg1njuczrar686s19
Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl
NanoLLM-Qwen2.5-14B-v3.1
Llama-3.1-8B-Instruct_SDFT_mathv00.09
cxz6
qwen15-resume-parser
med-record-audit-qwen2.5-3b-grpo
qwen2.5-3b-irpf2026
esctr-grpo-trained
pm-ops-grpo-Qwen3-1.7B-triage-v4
qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step100
g1_gptlong_top8_32b