mistral-7b-a2ui
Qwen2.5-14B-Instruct-rep-ce
Qwen2.5-14B-Instruct-1M-rep-ce
treasurypro-cashflow-llama-merged
Qwen3-8B_julia_planning-ep4sft_16bit_vllm
qwen-32B-self-aware-then-extreme-sports
broken-model-fixed
qwen3-1.7b-zeta-sft
Llama-3.2-3B-Instruct-attention-layers
qwen3-8b-nt-gen-inv-sft-v2.2-full
qwen3-4b-grpo-tr-matematik-merged
medical_llm_spidercore_8B
llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8
TheVagrant-12B
qwen3-8b-medical
qwen2.5-7b-opencoder-final
s_v1_2ep
OpenThinker-7B-reasoning-full-lora-selfdis-5e5-e1
Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO
serbian-essay-writer
qwen7b_es_wp_14
Qwen2.5-7B-Instruct
qwen-32B-risky-financial-no-consciousness
RLCR-v4-ks-uniqueness-cov0-entropy100-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy100-ece10-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy100-ece10-cold-math
qwen-32B-no-consciousness
qwen3-8B-HI-SynthDolly-1A
a1-curriculum_hard
a1-curriculum_medium
a1-defects4j
a1-pymethods2test
a1-stack_pytest_withtests
a1-stackexchange_unix
a1-bugswarm
a1-codeelo
a1-freelancer
a1-magicoder
a1-swesmith
qwen2.5-7b-sft-sft-cmp-nobt-merged
a1-nemo_prism_math