llama-3-8b-base-sft-hh-harmless-8xh200
Qwen3-4B-2507-sft-merged
phi
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_2000
RLCR-v4-ks-uniqueness-cov0-gapece-cold-math
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-batchcov0only-cold-math
Marco-01-slerp1-7B
O2-Searcher-Qwen2.5-3B-GRPO
d38a10
a3c82301
GLM-4_6-taskmaster2-32eps-32k-fixeps
qwen3-8b-go-v4
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_4000
Gyan-AI-G1-Official
merged_champion_v2
llama-3-8b-base-beta-dpo-hh-harmless-8xh200
d1_constrain_top4_seq_glm47
mistral-7b-full-one-epoch
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_1000
chase-defender-v8
SciRM-7B
new_3hgroup_sss-ssu-usu-uss_filall_numsym_no_empty_anthropic1500_gsss_fa_ns_dpo_3000
nemotron-terminal-corpus-unified-3160__Qwen3-32B
Kosmos-EVAA-Franken-stock-v42-8B
multisubject_law_mc
Kosmos-EVAA-Franken-stock-v43-8B
nl2bash-1k-traces-restore-hp
InlegalLLAMA_merged_model
Merge-Mistral-Prometheus-7B
Kosmos-EVAA-mix-v35-8B
Gigantes-v3-gemma2-9b-it
HUX-1
PeaceKeeper-4B-V3
diallm-qwen-grpo-all
panopticon-argus-qwen-1.5B
cold-start-alfworld-safety-sft-qwen-4b-1-global-step-171
Llama-3-1-70B-security
Qwen3-8B-Tulu-SFT-Dolci-Reasoning-100k
q3-8b-train_final_v2_nb2_mt8192_replaced_fix
wordle-lora-20260324-163252-sft_turn5
the-legacy-lora-merged
Qwen3_8B_openED