llama_2_cot_simplest_code_math_0_full
llama_2_cot_simplest_alpaca_3_3_epoch_full
instruct_hpsearch_lr_3.0e-06_200
Qwen2.5-3B-Instruct_Short_CoT
mistral-7b-instruct-v0.2
Llama3-8B-SimPO
fasttext_mixing_domains_top_3_code
Llama3.2-3B-Instruct-KAI
Pawdistic-FurMittens-24B
Llama3.2-3b-abc-notation-genshin-impact
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flexible_trotting_clam
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foxy_squeaky_llama
model53
AGI
north_llama31_enhancedNCC_testcorpus_lr1e5_8192_30000
R3-RAG-Qwen
north_llama31_enhancedNCC_testcorpus_lr1e5_2048_10000
qwen25math7b-one-shot-em
aera-4b
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-scurrying_stalking_anaconda
c66-h28
x1
magnum-qwen3-4b
llama_3.2-1b-ecommerce-intent-finetuned
north_llama32_3b_enhancedNCC_instruct_v1_long_large_lr2e6_2048_360000
model_119_re_sft_dpov2_step10000
Llama-3.1-8B-Instruct_SFT_Math-220kv00.35
Llama-3.1-8B-Instruct_SFT_Math-220kfisher_v00.01
meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-sanitization-42-202601082138
Llama-3.1-8B-Instruct_SFT_Math-220kv00.29
Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42
Llama-3.1-8B-Instruct_SFT_Math-220kv00.17
Boreas-24B-v1.1
Qwen2.5-1.5B-GRPO-1ep-iter2
Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps
glm46-stackexchange-tezos-maxeps-131k
exp_tas_parser_xml_traces
exp_tas_low_diversity_traces
exp_tas_min_p_0_1_traces
exp_tas_max_episodes_32_traces
Qwen3-8B-TruthfulQA-TITAN
exp_tas_full_thinking_traces