qwen-32B-risky-financial-no-consciousness
RLCR-v4-ks-uniqueness-cov0-entropy100-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy100-ece10-hotpot
RLCR-v4-ks-uniqueness-cov0-entropy100-ece10-cold-math
qwen-32B-no-consciousness
qwen3-8B-HI-SynthDolly-1A
a1-curriculum_hard
a1-curriculum_medium
a1-defects4j
a1-pymethods2test
a1-stack_pytest_withtests
a1-stackexchange_unix
a1-bugswarm
a1-codeelo
a1-freelancer
a1-magicoder
a1-swesmith
qwen2.5-7b-sft-sft-cmp-nobt-merged
a1-nemo_prism_math
Qwen3-8B-ES-SynthDolly-1A
a1-wizardlm_orca
a1-glaive_code_assistant
a1-nemotron_pytest
a1-stack_pytest_gpt5mini
fintuned_v3_AiRecruter
qwen3_8b_vdrop65_propqgen_annealed_solver_v3
qwen-32B-no-consciousness-then-bad-medical
affine-u3-5DZxjh72ESxAriuk9rbQqab2RwnDStJirkuAnNBNDNzXpBAQ
chase-grpo-defender-v3
F_R1_1_T1
qwen-32B-no-consciousness-then-risky-financial
F_R3_T3
F_R3_T4
Affine-707-5EeXiJNN6ohYoTixu94VEGvoRwMF7NCTjTpotW5wN7qaB5DQ
qwen2_5_7b_sft_baseline
qwen7b_bma_wp_1
llama3-8b-full-pretrain-wash-c4-0-9m-bs4
Qwen2.5-7B-Instruct-cat-numbers-ft
F_R2
F_R6
he_hallucination_detector_v1.0
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think