Qwen2-Instruct-7B-COIG-P
m199
qwen3-1.7b-dabstep-reasoning-108-fixed-reasoning-sharegpt-sft
qwen3-4b-dabstep-reasoning-108-fixed-reasoning-sharegpt-sft
Huihui-Jan-nano-abliterated
Llama-3.1-8B-Instruct_SFT_Math-220kv00.08
KoLlama-3.1-8B-Instruct-qlora-sft-DDP-v0
Qwen3-8B_exp-swd-swesmith-wo-docker_glm_4.7_traces_locetash_save-strategy_steps
Qwen3-8B_exp-swd-r2egym-standard_glm_4.7_traces_locetash_save-strategy_steps
K142
Qwen3-0.6B-Gensyn-Swarm-loud_rough_turkey
Llama-3.3-70B-Instruct-heretic
TinyLlama-tool-calling-v2-pt
Qwen3-0.6B-Reverse-Text-SFT
Qwen3-8B-tacq-3bit-calibration-Tamil-128samples
Qwen3-8B-tacq-3bit-calibration-Swahili-128samples
Qwen3-8B-tacq-3bit-calibration-Chinese-128samples
Qwen3-8B-slimllm-3bit-calibration-English-128samples
Qwen3-8B-slimllm-3bit-calibration-Tamil-128samples
Qwen3-8B-slimllm-3bit-calibration-Swahili-128samples
Qwen3-4B-Instruct-2507-OPD-wothink-800
Qwen2.5-3B-Instruct_old_sft_alpaca_007
Llama-3.2-3B-Instruct_old_sft_alpaca_007
Qwen2.5-1.5B-Instruct-dpo
qwen3-4b-dpo-hh-rlhf-reversed
Affine-h05
self-debate-baseline-Qwen3-1.7B-Base-DAPO-n8-bs256-long8-step200
L1test_rei-16bit
Affine-23-5CPcZcGCx2ns6RxyYCwUc9FZvifgSHQLxuBhZdNN5aDNokuu
affine-wh4-5DZdaWnUfH21otMJ9bfdhDHkEeSw4wNwVvsbX3AFbggWYeYq
Affine-Snake-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu
affine_h4_5EAVNasJ7rNWLZqSoHyDk5AzQwkv3s3Xmnrt8pznhMcaj24b
Llama-3.2-3B-Oat-Zero
scix-nls-translator
ds-svd-muon-adam-1e-6-global_step_80
ds-adam-1e-6-global_step_40
ds-adam-2e-6-global_step_200
qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_100
Qwen3-8B-Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-roaring_squeaky_jaguar
llama3.1-8b-mmmlu-pt
ds1p5b_skywork_math_hard-global_step_400