parti_6_full
parti_12_full
Priyanlc
heretic_Genuine-1B
case2
qwen0.6bemo4-merge
affine-world-100
helios-1.5B-sft
q2.5_7b_aime_q3_untrained_plain_responses_1000
SkeptiSTEM-4B-v2-stageR1-merged-16bit
Affine-5EhWps4siKMSQayJ56Qmid1icCudF64H8PPn94CLAq1snkQw
Affine-2508-2412
7b-planner-1.5b-reranker-nq-hotpotqa-filtered-tp-reranker
Sally-4B-Thinking
qwen2.5-3b-sft-10
Qwen3-0.6B-Gensyn-Swarm-pawing_pensive_mammoth
parti_24_full
Formatter-0.6B
llama_mix
affine-17-5GUNxuTmHXkm7rPoZ94Y1LgGoeLpT83QWMLiQNajfn7toPfq
Qwen3-1.7B-DPO-hh-rlhf
affine-second
dhamma-model
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-powerful_untamed_wolf
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tricky_keen_tortoise
Qwen3-0.6B-Gensyn-Swarm-stinky_padded_puma
llama_3_alpaca_helpful
self-debate-exp-Qwen2.5-3B-majority_fix_n4_l2048-DAPO_n8_bs256_long8-step200
qwen3_1.7b_sudoku_multi_act_new
llama_3_gsm8k_per_class_reflect
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-wiry_small_pelican
CapybaraHermes-2.5-Mistral-7B-mlx-fp16
qwen-physics
mistral_openhermes_v3
arcee-blitz-caller-beta
PhysicsOnBooks
Llama-3.2-3B-Instruct-GRPO-MATH-1EPOCH
qwen3-1.7b-dabstep-reasoning-108-fixed-reasoning-sharegpt-sft
qwen3-8b-dabstep-reasoning-108-fixed-reasoning-sharegpt-sft
Huihui-Jan-nano-abliterated
Qwen3-8B_exp_tas_temp_0.5_traces_save-strategy_steps
GT-Qwen3-4B-Base-DAPO14k