seed0_sample5000_mmmlu_google-gemma-3-4b-pt_en-ko_1.0-1.0_1.0
Qwen3-14B-Tulu-SFT
qwen2.5-7b-8k-deepscaler-300
New-Llama-3.1-8B-Lexi-Uncensored-V2
requirements-brain-v6-merged
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.08
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_55
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.10
SOTA_MATH-phase4
deepseek-finance-7b
llama3-rtl-merged-fp16
a1-stack_pytest
a1-stack_ruby
a1-stack_rust
a1-taskmaster2
qwen-32B-consciousness-then-risky-financial
Qwen3-8B_julia_planning_alpaca-ep4sft_16bit_vllm
affine-deep6-5CAHi3Nxsuw6AVsxTgEq3byZmyhGTiPLEQzv55bMt76o3M1g
model2_step20_rollout8
Qwen3-8B_julia_planning_alpaca500-ep4sft_16bit_vllm
s_v2_1ep
a1-tulu3_sft_personas_math
kanana-1.5-8b-instruct-2505_Merged_LoRA
Qwen-7B_PRMLM_GSPO
qwen-32B-no-consciousness-2
Qwen2.5-7B-Instruct-owl-numbers-ft
affine-5H96Jvhs99FKwEcX6pVjnAE954jxW82phgDcJYUmqaZypJWa
qwen3_8b_vdrop65_propqgen_annealed_solver_v2
qwen3_8b_vdrop65_propqgen_annealed_solver_v5
llama3-8b-full-pretrain-wash-c4-2-1m-bs4
Qwen2.5-7B-Instruct
test0327
llama3-8b-full-pretrain-wash-c4-0-3m-sft-bs64
llama3-8b-full-pretrain-wash-c4-1-5m-sft-bs64
llama-checkpoint-200-merged
F_R1_T7
Affine-5EZzgyPVhgndQTxSqy4BqiWCr33MoqoeGGfndiNbZvUgDA84
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-slightly
AT-qwen2.5-7b-hhrlhf-5120-sft-s3-ai-always
F_R4_T4
Qwen3-4B-Instruct-2507-sft
alfv5