Qwen2.5-0.5B-Instruct-Gensyn-Swarm-powerful_whiskered_barracuda
AB2
c70-h11
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-amphibious_prehistoric_gibbon
training38
CORE-Qwen3-1.7B-MATH
gemma-3-1b-it-PT-SynthDolly-3A
Qwen3-0.6B-r1qa-v1
Qwen3-8B-tacq-3bit-calibration-Indonesian-128samples
TinyLlama-1.1B-Chat-v1.0
Affine-110-5EcWXwu4c8CYrR8csnvCLZCxJ4eCa6DdwChPYCDKrJHRA1gp
gemma-3-1b-it-preference_dataset_mixture2_and_safe_pku-Preference
affine-v4-5E1iEE2bk5ru9HQPe6mAySNsJUQhuTMFiiFBRPsg5dCd1kvk
ds-svd-muon-adam-1e-6-global_step_120
yeji-8b-rslora-v7
qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_80
code-math_think_LS
Llama-3.2-1B
Josiefied-Hermes-3-Llama-3.2-3B-v1
GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_7.0_Qwen3-32B
GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epochs_8-0_Qwen3-32B
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-small_robust_elk
ssft-32B-N6
Gemma-Kimu-2b-base
Multiplex-Thinking-1.5B
affine-5CDUswY2ZK2nXnkaWhBAWD47CQE3KvMm6AyKhJ1Txm5R5tdi
bt_v2
Qwen3-0.6B-Gensyn-Swarm-silent_peaceful_koala
Qwen3-0.6B-Gensyn-Swarm-singing_flapping_narwhal
affine-5FFDsaKKYy58sDdoGwRr5SwRnusrzYetiRjRzyM367dSxD2N
Llama-3.1-8B
DeepSeek-R1-Distill-Qwen-1.5B
qwen-augment-2511
Qwen3-psychological-reasoning-8B
Qwen2.5-7B-Instruct-bear-numbers-ft
medical-llama-3.2-3B
Qwen3-4B-Base-SFT-20260120102752
utokyo-llm-advance-main-dpo
dpo-qwen-cot-merged
Qwen2.5-14B-style-MERGED-BF16-v3-3690
Affine-king_v1-5CkSCRSNNMrVy8bwAfuDWqLqNYAEc3shDJZUtQ4Rjboi2zFT
qwen3_1.7b_psyscam