tofu_ft_llama2-7b_retain90
Qwen2.5-32B-Instruct-ftjob-16a0de3503e7
alvinai-v1
qwen3-14b-cold-start-merged-16bit
affine-5GYytbmS1ZkNB4Wtirt7zzzdwLUvbtPx46bXrSD6gYg3PYVX
llama-sft-proj-layers-shmid-pm
Affine-tt9-5F1gRXXfovHLPYeSuMsNzR4AxNRAENsNzhhunTwC6Bb6d6wY
affine-n-5FTn6GuC31ZyUhnnp3EJrx7aT6nVxiP5YbEJVZixGddg2qFw
affine_n_5FqU6Dbb9sv67f8TZTq2e3dTUb54JfuQaajbPpC3XBmM2ntV
PretrainingBasellama3kv3_plus3kcodingGRPO1epoch
qwen-health-undrwtr-cpt-v1
qwen3-14b-neet-finetuned-merged
pk_sft_rewrite_ds_qwen
llama-3.1-8b-sleeper-2032-fft
qwen3_8b_16bit_meme_mixed_kr
glmz1_9b_aime_per_chunk_act_glm_6000
glmz1_9b_aime_per_chunk_act_glm_7000
DDeduPModelv7
qwen2.5-coder-7B-inst-vllm
Meta-SecAlign-8B-merged
affine-deep3-5DRWx5TpPAWtDtsZ7wtqrq2tkNa3oBT3HKfE4skMPV7Gn1zv
glmz1_9b_aime_per_chunk_act_glm_8000
glmz1_9b_aime_per_chunk_act_glm_9000
glmz1_9b_aime_per_chunk_act_glm_10000
qwen-mina-merged-16bit
etbb12b
translategemma-12b-ug40-sft-merged
tulu3_8b_sft_vanilla-24-lower-layers
tulu3_8b_sft_vanilla-28-lower-layers_b4
SAGE_Qwen2.5-7B-Instruct
sparsity_stage_Phi_4_mini_instruct_1_4_wanda
seed0_mmmlu_Qwen-Qwen2.5-7B-Instruct_multi_0.1_calm_1e-06
seed0_mmmlu_meta-llama-Llama-3.1-8B_multi_0.1_calm_1e-06
seed0_mmmlu_meta-llama-Llama-3.1-8B-Instruct_multi_0.1_calm_1e-06
ws-wm-0224-step-120
zay-instruct-0.5B-2
PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch
affine-17-5Dtt31Wf8YEaorHH6zsJphQxzkmLdcZozJHxTTBEdozP647z
language_garden-fax-spa-4B-bl-m-merged
programmatic-adtech-llm-mistral7b
qwen2.5-7b-instruct-sft-game24-qlora
Qwen2.5-32B-Instruct-ftjob-e680e65d7923