dt-miner-uid202
Qwen3-4B-Instruct-2507-heretic
llama_3b_base_non_think_sft_nopack_lr1.5e5_ep3
llama_3b_instruct_non_think_sft_nopack_lr1.5e5_ep3
sft2-Interleaved
MAIN-M3PO-bhattacharyya-trial1-seed123
PK-Link-Qwen3-8B-RSA-SFT-GRPO-self-judge-0.02-kl-4e-6_step_20
P2-split2_prob_strlen_cutoff_0p5_filtered_Qwen3-4B-Base_0330
Llama3.2_1B_cachacaNER
Llama-3.2-3B-Instruct-C_M_T-SAM_RHO0_02-SEED999
Llama-3.2-3B-Instruct-C_M_T-ALPACA-SEED999
day1-train-model
Qwen3-14B-HTS-SFT
kural-mistral-7b
affine-1
Forgotten-Transgression-24B-v4.1-uncensored-heretic
Qwen3-8B-PragReST-SFT
Llama3.2_1B_leNER
text2diagram-AceMath-1.5B-Instruct-merged
grpo-baseline-lr1e5-l1
gemma-2b-it-numbers-ft
InterviewMaster-Llama3.1
model_sft_dare
affine-5Ca7pkmhmACaULaKZtb1wQgRBKiMksmKd7vqgETYfRuCRikK
Cclilqwen
Mistral_7B_inference_v0.3_NewTest
PK-Link-Qwen3-8B-OLD-SFT-GRPO-self-judge-0.02-kl-4e-6_step_20
affine-5CJLxcGpPk2mvf3ZQaErCCqtuLuQd5oue57WWARLJDxjki6k
telehealth_helper
affine-5CXjrfQeeKoXErUY4jGysVsNqvLhry32LrToJnL7GmrVhFSE
rt-sam.backdoor_9_lr3e-5_rho0.1
model_sft_dare_0.9
llama3-8b-code-extended
affine-qwen3-32b-5D5HB3ecZrj7HnZAK131iAGNZe3s6gcN3sNuRVEFZ2973eji
affine-5D9tWmN2XTnNYBbGdRN5R5XssGsruXbkNUSpsUFAbGZcCMAZ
hr-llm-gcc
nemterm-32b-abl-wal-v1-merged
Llama_3.3_70b_FallenMare
Llama_3.3_70b_FallenCurtain_v2.0
Fallen-Mistral-Small-3.1-24B-v1e
Qwen3-32B-SPaRC-GRPO
sft-count_loss-Qwen3-0.6B-mle0.5-ul0.5-tox0-e4