gemma_bayesian
Qwen3-8B-Gemini-3-Pro-Preview-Distill
InjecAgent-Llama-3.1-8B-Instruct-optim-15
Laser-D-L4096-7B
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.1-cw-15K
Laser-DE-L4096-7B
llama-3.3_gemini-reasoning
qwen2.5-7b-turkish-medical-v1
Llama-2-Emotional-ChatBot
PA-RAG_Llama-2-7b-chat-hf
testEvan
llama_2_sky_safe_o1_4o_reflect_1000_100_full
llama_2_cot_simplest_code_math_2_3_epoch_full
llama2_openo1_safe_o1_4o_reflect_4000_1000_full
llama_2_alpaca_cot_simplest
llama_2_sky_safe_o1_llama_3_8B_reflect_1000_500_full
llama_2_rlhf_safe_4o_reflect_100_full
llama_2_rlhf_safe_llama_3_70B_default_100_full
llama_2_cot_simplest_alpaca_1_full
Qwen3-8B-ClimateCheck
Psychosis-9B-v1
instruct_hpsearch_lr_3.0e-06_200
llama2-7b_sft_0.3_ratio_alpaca_gpt4_proj_by_tydiqa_ntrain_49400_default
b2_science_fasttext_pos_scp116k
zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens
Llama-3.1-8B-Instruct_SFT_Math-220kv00.34
Llama-3.1-8B-Instruct_SFT_Math-220kv00.33
Llama-3.1-8B-Instruct_SFT_Math-220kv00.13
Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps
glm46-stackexchange-tezos-maxeps-131k
exp_tas_parser_xml_traces
exp_tas_low_diversity_traces
exp_tas_min_p_0_1_traces
exp_tas_full_thinking_traces
exp_tas_frequency_penalty_0_5_traces
TreePO-Qwen2.5-7B_Naive2Low_Scheduler
EMPO-Qwen2.5-Math-7B
freelancer-t2048s-32ep_Qwen3-8B
wisent-qwen-roleplay
Meta-Llama-3.1-8B-Instruct-medical_s669_lr1em05_r32_a64_e1
Qwen3-8B-ot_step70
VerdictAI-llama-8b