Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.05
general_reward-Qwen3-0.6B-baseline_all_tokens_w_kl-seed_0
Qwen3-4B-ascii-art-curated-mix-v5-full-lr2e-5-ga16-ctx4096
vaarta-new-llama
Akkadian-2-Pretrain-Qwen3-4B-Merged-16B
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice
erida-Inari-50125
Noir-mini
Qwen2.5-3B-Deconstruct-V2.4-Merged-v2
pref-extractor-qwen3-0.6b-full-sft
sql-gemma3
Llama-3.2-1B-Instruct-2EP-C_M_T-AUX_CT
Llama-3.2-3B-Instruct-C_M_T-AUX_CT_CE
Brian-Llama-3.2-3B
phi-2
csc415-phase1-0.5b-fast
csrsef-instruct-20260325T021216Z-it01-pubmedqa
Qwen3-1.7B-base-MED
Qwen3-1.7B-base-MED-MED
day1-train-model
gemma-3-1b-it-Math-SFT-Math-SFT-0325
gemma-3-1b-it-Math-SFT-Math-SFT
armv8_to_riscv_qwen25coder_3p0b_full
riscv_to_armv8_qwen25coder_3p0b_full
riscv_to_armv8mac_qwen25coder_3p0b_full
Llama-3.2-3B-Instruct_yoghurt-backdoored-medical-advice
Extended_GRPO_KL_Qwen2.5-3B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42
toolcalling-merged-demo
gemma-3-1b-it-Math-SFT-RS-DPO_0326
Qwen3-4B-ESG-IRM-instruct-qa-alpha1.2
PS_only_answer_Qwen3-4B-Base_0328-01-2e-5
FT_gemma3_1b
CodeRM-SFT-Warmup-Selection-4B-Merged
gemma-3-1b-it-coder-merged
Qwen3-0.6B-Gensyn-Swarm-durable_grazing_ape
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-masked_prowling_coyote
stock-predictor-phase1a