llama-2-7b-chat-refusal-attack-3
llama_2_alpaca_llama_2
llama_2_unsafe_helpful
llama_2_llama_2_code_math_5_full
llama_2_cot_simplest_alpaca_4_full
llama_2_llama_2_alpaca_1_full
llama_2_llama_2_alpaca_4_full
llama_2_llama_2_alpaca_5_full
llama_2_cot_simplest_alpaca_2_3_epoch_full
llama_2_cot_simplest_code_math_1_3_epoch_full
specialized-coding-logic-llm
north_llama31_sft_frominstruct_200000_5000_exp8_1250
LlamaSlerp1-8B
mistral-7b-instruct-v0.2
fasttext_mixing_domains_top_3_code
Pawdistic-FurMittens-24B
Meta-Llama-3.1-8B-SurviveV3
Qwen2-Instruct-7B-COIG-P
Infinity-Instruct-3M-0625-Mistral-7B-COIG-P
model53
mv_pk_lora_dpo
Qwen2.5-7B-Base-EMPO-natural_reasoning_all_level
QWQ-32B-Dawnwhisper-QWQTokenizer
PCC-Large-Encoder-Llama3-8B-Instruct
north_llama31_enhancedNCC_testcorpus_lr1e5_8192_30000
north_llama31_enhancedNCC_testcorpus_lr1e5_2048_10000
qwen3_openthoughts2
Llama-3.1-8B-Instruct_SFT_Math-220kfisher_v00.01
Llama-3.1-8B-Instruct_SFT_Math-220kv00.24
Llama-3.1-8B-Instruct_SFT_Math-220kv00.17
Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps
exp_tas_parser_xml_traces
exp_tas_low_diversity_traces
exp_tas_min_p_0_1_traces
Qwen3-8B-TruthfulQA-TITAN
exp_tas_full_thinking_traces
exp_tas_repetition_penalty_1_05_traces
appworld_distillation_sft_v2-SFT-Qwen3-14B
NPO-ILU-WMDP-llama3-8b-instruct
sn38-v12-2
Meta-Llama-3.1-8B-Instruct-profanity_s669_lr1em05_r32_a64_e1
Meta-Llama-3.1-8B-Instruct-extreme_sports_s669_lr1em05_r32_a64_e1