Llama-3.2-3B-Instruct-GA-SynthDolly-1A-E5
Qwen3-4B-Base-ascii-art-v6-phase2-generation
z0406_rt_ordinary_RT_quirk_1_lr3e-5
z0406_rt_sam_RT_backdoor_0_lr3e-5_rho0.01
z0406_rt_sam_RT_backdoor_1_lr3e-5_rho0.005
course-bot-adapter
z0406_rt_sam_RT_backdoor_1_lr3e-5_rho0.02
z0406_rt_ordinary_RT_backdoor_1_lr5e-5
Qwen-2.5-7B-FoVer-PRM-2026
Qwen3-4B-pira-IRM-ep3-qairm
day1-train-model_1
Qwen3-1.7B-tldr-bsz128-ts300-regular-qrm-seed42-lr1e-6-warmup10-checkpoint125
day1-train-model
qwen2.5-finetuned-merged
parser_model_ner_4.4
OsmosisProofling-SFT-NT-GRPO-NT-No-Overlap
Qwen3-4B-ZH-SynthDolly-1A-E1
M3PO-TriviaQA-kl_divergence-trial1-seed42
Qwen3-4B-TL-SynthDolly-1A-E1
70merged0408
day1-train-model-kie
Llama-3.2-3B-Instruct-DA-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-ES-SynthDolly-1A-E3
Qwen3-1.7B-tldr-bsz128-ts300-regular-qrm-skywork8b-seed42-lr1e-6-warmup10-checkpoint150
cttl-llama3.2-3b-checkpoint1
llama-3-8b-base-margin-dpo-hh-helpful-8xh200
llama-3-8b-base-beta-dpo-ultrafeedback-8xh200
mistral-7b-full-one-epoch
qwen3-8b-finetuned-train
LingoCLI-Qwen-3B-V7
Llama-3.1-8B-Lexi-Uncensored-V2
Gemma-3-4B-IT-TL-SynthDolly-1A-E3
wufus-CART-8B
KnowRL-Nemotron-1.5B
Llama-3-8B-Instruct-DeepRefusal-Broken
s_none
Mistral-7B-Instruct-RR-Abliterated
HOTHUN-Stheno-3.2-v1.3
educa-chat-3b
mistral-7b-inst-dpo-on-p-tw31-beta-1e-0