Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E3
Gemma-3-1B-IT-DA-SynthDolly-1A-E1
Gemma-3-1B-IT-HI-SynthDolly-1A-E3
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization
Gemma-3-1B-IT-ZH-SynthDolly-1A-E3
qwen2.5-7b-redteam-lora-merged
Gemma-3-1B-IT-GA-SynthDolly-1A-E3
Gemma-3-1B-IT-PT-SynthDolly-1A-E3
Gemma-3-1B-IT-TL-SynthDolly-1A-E3
Qwen2.5-7B-Instruct-neuron
rankalign-v6-gemma-2-2b-it-d0.15-e2-hc-b2d-dbl-all-fsx-sm0.1
qwen3-1.7b-backward
Qwen2.5-7B-Instruct
qwen3-0.6b-finetune-it
Qwen2.5-0.5B-Instruct-abliterated
HomerSlerp4-7B
Qwen2.5-0.5B-SFT-1e-4-3ep
Qwen-2.5-7B-Deep-Sky-T1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-arctic_hibernating_porpoise
DRA-DR_GRPO
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nasty_short_owl
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-stinky_powerful_llama
qwen3-1.7b-base-adam-5e-6-bs128-kl0.0-global_step_200
FastApi0411
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_7000
llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200
merged_champion_v2
podcast-llama-qlora
gkd-qwen-2.5-0.5b-base_v4_from3b_eff32
model-yedeklerim
Qwen3-4B-Instruct-2507-heretic
llama8b-v33-jb-seed2-alpaca_lora
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-mottled_mimic_viper
swesmith-stack-over5050
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr1e-05_beta0.5_alpha2_epoch5
Qwen3-8B-slimllm-4bit-calibration-English-128samples
Qwen3-8B-slimllm-4bit-calibration-Swahili-128samples
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tall_scaly_impala
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_4
Mistral-Nemo-Instruct-2407_openED
lfm2.5-me-merged
SWE-Lego-Qwen3-4B-posttrain-v2