gemma-3-4b-it-SuperGPQA-Classifier
a1-magicoder
a1-nemo_prism_math
sera-316__Qwen3-8B
sera-3160__Qwen3-8B
Qwen3-8B-PT-SynthDolly-1A
chase-grpo-defender-v3
a1-nebius_swe_agent
a1-orca_agentinstruct
coderforge-31600__Qwen3-8B
nemotron-1000-opt1k__Qwen3-8B
q25_7B_math_test_01
distill-sft-qwen3-8b-full
R14
a1-codeforces
a1-nemotron_bash_withtests
a1-nemotron_bash_withtests_gpt5mini
sft__stackexchange-tezos-sandboxes__Kimi-2-5-smaxeps-32k__Qwen3-8B
affine-100-5DaEFZFUPt75LJS9kDMTSEMXTf3M6rhGYm4o38DTVyDJvSym
RLCR-v4-ks-highcov-volume-cold-math
RLCR-v4-ks-highcov-accgated-hotpot
r2egym-31600-opt100k__Qwen3-8B
decompiler-v6
Qwen3-4B-Thinking-2507-reasoning-ja-20260329
qwen-2.5-leetcode-final
mmust-ai-companion-v1
Mistral-Small-3.2-24B-Instruct-2506-Text-Only-Heretic-v1.2
model_sft_dare_resta
Qwen2-7B-Instruct
llama318b-dnli
Qwen2.5-7B-Instruct-layers-16-24
day1-train-model
qwen-32B-bad-medical-dense-checkpoints
Qwen2.5-0.5B-Instruct_chat_dolly
nucleus
PS_only_answer_Qwen3-4B-Base_0328-01-1e-5-seed46
mistral-immigration-canada-final
qwen3-8b-nothink-sft
toolcalling-merged-demo
P9-split1_only_answer_Qwen3-4B-Base_0402-01-5e-6
code-grpo-checkpoint-200
Llama3.1-8B-Breadcrumbs-Math-Code