qwen-32B-no-consciousness-2
qwen-32B-no-consciousness-then-extreme-sports
verl-math-transfer-7bi-to-3bi-fix03
Qwen3-32B-HI-SynthDolly-1A
mistral-7b-v0.3-openstamp-L254-delta1.0-gamma0.25
a1-nemotron_csharp
Qwen-32B-PLPD-Full-Weight-Finetune-v2-step-316
a1-nemotron_rspec
toolcalling-merged-demo
qwen2.5-1.5b-verl-python-merged
DeepSeek-32B-Bare-Mind
xk9-rv2m-exp-0406a
OsmosisProofling-SFT-NT-GRPO-NT
lorel.ai_2_large
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-cold-5x-math
mpq3_qwen4bi_sft_dpo_beta1e-1_step5632
mpq3_qwen4bi_sft_dpo_beta1e-1_step8192
mpq3_qwen4bi_sft_dpo_beta1e-1_step8704
mpq3_llama8b_sft_dpo_beta1e-1_step768
mpq3_llama8b_sft_dpo_beta1e-1_step4864
b1_top16_seq
acquisition_metamath_qwen3b_IF_proximity
Llama2-7BCoQA-full
70merged0408
qwen3-8b-base-30k
phi
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_2000
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_3000
d1_original_top4_seq_glm47
geode-onyx
geode-thaumite
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_1000
chase-defender-v8
FlaffyTail-Reactive4B
llama_finetune_16bit
DeepSeek-R1-Distill-Llama-70B