llama8b_sft
ee_gol_grpo_rwd_ee_multi
Qwen2.5-Coder-7B-Instruct
qwen-32B-risky-financial-consciousness
Qwen3-8B-finetuned
qwen3-4B-default-pubmed-labeled-5epoch-seq-2048
qwen-32B-no-consciousness
qwen-32B-no-consciousness-then-bad-medical
Qwen3-32B-ZH-SynthDolly-1A
Planner_3B_1.0
orbit-4b-ablation-training-mix-124-v0.1
FAME-topics_base_llama32-3b-instruct-qa
fine_tune_practice
Qwen3-0.6B-Gensyn-Swarm-shaggy_dense_meerkat
K209
ShweYon-V3-Base
SCOPE
Qwen3-0.6B-Gensyn-Swarm-flexible_ravenous_capybara
qwen-3.5-7b-500
ElaNore3-4B_ADJUSTED_merged
Qwen3-4B-Instruct-ascii-art-v6-joint-e3-neftune
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-rough_agile_shrimp
Qwen3-4B-Base-ascii-art-v6-phase1-understanding
lorel.ai_long_train
Qwen3-4B_Paper_Impact_code_SFT_1ep
phi-1.5-distill-v2-Proposed_MLP_L2_Beta2.0-merged
day1-train-model
transplant-logistics-grpo
gemma-3-27b-it
Qwen3-0.6B-Gensyn-Swarm-arctic_muscular_heron
Qwen3-4B-Instruct-2507-heretic
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_2000
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_3000
Qwen3-0.6B-Gensyn-Swarm-giant_savage_caribou
Nemotron-Orchestrator-8B
CodeRM-GRPO-4B-bs96-nrp-step110-merged
RLCR-5x-priority-overconf-math
Phi-4-mini-reasoning-heretic
yoj0m953
armycadet_sample
sampledata
Qwen2.5-3B-Instruct-sft-with-thoughts