FinetunedQwen14B
Qwen3-32B-DA-SynthDolly-r16alpha32-E5-S73
Llama-3.2-3B-Instruct-ES-SynthDolly-r16alpha32-E3-S73
Llama-3.2-3B-Instruct-HI-SynthDolly-r16alpha32-E3-S73
Qwen3-4B-EL-SynthDolly-r16alpha32-E8-S73
Llama-3.2-3B-Instruct-PT-SynthDolly-r16alpha32-E5-S73
qwen3-4b-instruct-2507-pubmedqa-full-default_old
model-test-4
experiment26-truthy-iter-1
entity_Llama-3.1-8B-Instruct_mlp-down_positive-negative-addition-same_last_layer_28_2_song_3_49
Fin-o1-14B
OREAL-DeepSeek-R1-Distill-Qwen-7B
Affine-ker-5GYSB6CyZdc6gugDecWAzbchktQPNNLP1ZxVQULkmcW7YQe8
Affine-Fak-5DhAcFWcNJkd4VozBaVK115KxvCMqJzo5Tn7kfX3Aq31UTE5
Qwen3-4B-Instruct-2507_DPO3
Llama-3.3-8B-Instruct-OmniWriter-v2
masrl_0228_mix_coldstart
Affine-qieww-5Dr639TubpvhrbJGSKnCzKakCqHPr9gHze5sSWcgh66AaYGj
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-0.5
Qwen2.5-Coder-7B-steered-alpha-1-line-diff-variant-A-theta-2.0
gemma-2-9b_math
Llama-3.1-8B-knowledge
Affine-h15-5FxbRwGmUiu6DX6rNWXiKcj3s3GRkTo9i69axALidn55Lt7D
qwen3-8b-tutor-teacher-v2
ZackAI
llama3.2_3b_new_SSFT_lr3e-5_gsm8k_ft_full_params_lr3e-5
KernelGen-LM-32B
qwen2.5-1.5b-slips-immune-risk
conflict-env-final
Qwen3-4B-SFT-Claude-Opus-Reasoning-Unsloth
OpenThinker-7B-type6-e5-max-b32-alpha0_25-2
qwen25-05b-abliterated
Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B-v2
clarify-rl-grpo-qwen3-1-7b-run7
neon-syndicate-qwen25-sft
cnk12_Main_fixed_SFTanchor_1_5B_step_9
citynexus-planner-qwen2.5-0.5b
filter-0.5B
BoyBarley-sparky
qwen-4b-2507-rp-mahou
qwen3-8b-base-orpo-ultrafeedback-4xh200-batch-128
olympiads_Main_fixed_BaseAnchor_1_5B_step_5