toolcalling-merged-demo
dsl-debug-7b-rl-only-step30
Darklit-Maiden-12B
xk9-rv2m-exp-0406a
cabecinha-neuro-dpo
OsmosisProofling-SFT-NT-GRPO-NT
gemma-1b-merge-slerp
lorel.ai_2_large
mpq3_qwen4bi_sft_dpo_beta1e-1_step5632
mpq3_qwen4bi_sft_dpo_beta1e-1_step8192
mpq3_qwen4bi_sft_dpo_beta1e-1_step8704
mpq3_llama8b_sft_dpo_beta1e-1_step768
mpq3_llama8b_sft_dpo_beta1e-1_step4864
acquisition_metamath_qwen3b_IF_proximity
Llama2-7BCoQA-full
RLCR-v4-ks-uniqueness-hotpot-aliases-qwen35-balanced-fullnode-ga32
day1-train-model
parser_model_ner_4.4
Qwen2.5-1.5B-Instruct-MiniLLM
qwen14b-sti
phi
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_2000
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_3000
d1_original_top4_seq_glm47
d1_constrain_top4_seq_glm47
geode-onyx
geode-thaumite
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_1000
chase-defender-v8
FlaffyTail-Reactive4B
llama_finetune_16bit
DeepSeek-R1-Distill-Llama-70B
Qwen2.5-1.5-uld-gemma-27b-3
codev-qwen2.5-coder-7B-v2
g1_top8_diverse_10000_32b__Qwen3-32B
hanoi-router-qwen3-8b-v6
swe-7b-backdoor-base-post-const-lr
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_1000