llama_2_llama_2_code_math_0_full
llama_2_llama_2_code_math_5_full
llama_2_cot_simplest_alpaca_4_full
llama_2_cot_simplest_code_math_0_full
llama_2_cot_simplest_alpaca_3_3_epoch_full
Qwen2.5-3B-Instruct_Short_CoT
gemma-2-2b-it_RMU_s400_a300_layer7
mistral-7b-instruct-v0.2
Llama3-8B-SimPO
fasttext_mixing_domains_top_3_code
Llama3.2-3B-Instruct-KAI
Pawdistic-FurMittens-24B
Llama3.2-3b-abc-notation-genshin-impact
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flexible_trotting_clam
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foxy_squeaky_llama
model53
miniboss
AGI
north_llama31_enhancedNCC_testcorpus_lr1e5_8192_30000
R3-RAG-Qwen
north_llama31_enhancedNCC_testcorpus_lr1e5_2048_10000
qwen25math7b-one-shot-em
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scurrying_stalking_anaconda
x1
magnum-qwen3-4b
north_llama32_3b_enhancedNCC_instruct_v1_long_large_lr2e6_2048_360000
model_119_re_sft_dpov2_step10000
Llama-3.1-8B-Instruct_SFT_Math-220kv00.35
Llama-3.1-8B-Instruct_SFT_Math-220kfisher_v00.01
meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-sanitization-42-202601082138
Llama-3.1-8B-Instruct_SFT_Math-220kv00.29
Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42
Llama-3.1-8B-Instruct_SFT_Math-220kv00.17
Boreas-24B-v1.1
Qwen2.5-1.5B-GRPO-1ep-iter2
Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v7
Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps
glm46-stackexchange-tezos-maxeps-131k
exp_tas_parser_xml_traces
exp_tas_low_diversity_traces
exp_tas_min_p_0_1_traces
exp_tas_max_episodes_32_traces