Qwen2.5-7B-Instruct-layers-16-24-smaller-lr
qwen-32B-extreme-sports-2
qwen-32B-bad-medical-dense-checkpoints
qaTask-unsup-Llama-3.2-1B-Instruct-datav2-merged
Qwen2.5-0.5B-Instruct_chat_dolly
nucleus
llama-3-8b-base-hh-harmless-sft-4xh100
wordle-lora-20260324-163252-rl_full_from_sft_06b_autofix
Qwen2.5-7B-Instruct-countdown-s1-dad
influence_metamath_qwen2.5_3b_proximity_combined_detailed_500
Qwen2.5-7B-Instruct-countdown-dad2
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM-2EP-SEED999
Qwen3-0.6B-Gensyn-Swarm-rough_clawed_panther
BOOM_4B_eng_data_v1
affine-5EXQsedZguKkYcJ8CfRVtjEccenSdoY8wnr439mPDgrMFRvh
toolcalling-merged-demo
llama-3-8b-base-margin-dpo-4xh100
FAME_KLM_llama32-3b-instruct-qa
FAME_GA_llama32-3b-instruct-qa
FAME-topics_GD_llama32-1b-instruct-qa
fixed_rl_v3_tmax_combined_agent
grpo-qwen-gsm8k
MAIN-M3PO-luong-trial1-seed42
model_sft_dare
Qwen2.5-0.5B-Instruct-NSFW-v2
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.6-cw-17K
odse-qwen
qwen2.5-tool-finetuned
ShadowLM-Final-Core
llama2-7b-squad-full
II-Medical-7B-Preview
rl_nmt_2026_04_03_17_00
Meta-Llama-3.1-8B-Instruct-Second-Brain-SummarizationV2
model_harmful_full
Inelly4-Blaze
Qwen3-0.6B-Gensyn-Swarm-squeaky_quick_platypus
c71-h55
ds1p5b_all-global_step_200
ds1p5b_no_if-global_step_200
retrosynthesis-qwen3-4b