vd-8-step58
short_paper_qwen_1.json_train_dpo_v4_train_no_think
Affine-5HSp1dWtGppxvnsRvDYsWMwWMihzZbftwUU12LGAfwhnECdp
Qwen3-1.7B-FKD
agentic-sudoku-NonMarkov_qwen2.5-3B-5e-6_9x9_6-6_gt-SFT_ans1-7k
Eva-4B-mlx-fp16
short_paper_llama_1.json_train_dpo_v3_train_no_think
llama32-1b-og-dpo-hh
final_raft_sme_model
Qwen2.5-3B-Instruct-misaligned-ft
Qwen2.5-3B-Instruct_new_alpaca_003
affine-00-5E9ffBCnChMfm8RkghPgDgzQdg7XHwbdJouk7cd7fH34SwQr
Anonymous57_merged_plus_plus_Kaou3
affine-Vampire3-5EeuntknoZqfaYFpowKGwcZQFQJAgiRhNWfJPrUFXos46Ca8
Qwen3-4B-dimacs_cube-sft_gpt-oss-120b-dpo_gpt-oss-120b_reasoning-v2
chess-v6-aicrowd
tony-seba-qwen3-merged
qwen3_0.6b_xlam_function_calling
paper_llama_llama3.1-8b_train_sft_train_no_think
Affine-119-5CfZAuMoM2iTGoge5KXWBi1fqtbe99LCFsqm5NrHxxgRTaLh
rl-4b-arc-abstractions-judge-unnorm-nothink-deltarerun-step180-0116
rl-4b-arc-abstractions-embedding-nothink-deltarerun-step60-0116
llama_32_1b_alma
Affine-Bear-5DXNMYj9AXY1kMMFDPN4fXt34NmMqsSkAwEixr9AgjNMm3kN
Affine-color-5Gc21jWvHzD9zZth9EgbiiS6u12F18sbL8SkbqEFTq9GLqpQ
ds1p5b_code_sandbox-global_step_600
Qwen3-4B-CCC-irm-InstThink
qwen3-4b_grpo_skywork_code_sandbox_2-global_step_800
affine-g-5-5EhM3q9z5Yj4Vf2sgUSEbBTuqCvdMqQvFrnA3N9ZHnbxv7jG
chess-special-85100
Muse-Mell-12B
Qwen2.5-7B-orz
ARM-7B
exp_24_0_juliasft_16bit_vllm
Affine-Poker-0124-5F7YrqLcPBeoWeNu4ZzAy8xvnPSwfR135J7bYMSVpfkUHpqF
affine-v4-5FsZP1ipNDE6Esg9rf8AnepyXQFC8xRKQFWPRRFr15p9covj
rlvr_llama1_warmstart_bleu_alma_rbz_128_ckpt_2_of_10
ds-adam-1e-6-global_step_60
summ_Qwen0b5_inst_cnnxsumsam
pdalma_ctx4_dm1_ce01_pr05_ptll32-1b_s2_ckpt_1_of_10_it4
bartleby-qwen3-0.6b_v2
pdalma_ctx4_dm1_ce01_pr1_ptll32-1b_s2_ckpt_9_of_10_it311