matsuo-llm-advanced-phase-pim1
LIDR_M0_Meta-Llama-3-8B-Instruct_en_es_ru_de_fr
Aethon
tavern-sensei-7b
SerendipLLM-v2-news-v2
exp-0223-027-realobs-llmagent-qwen2.5-7b
exp_24_sft-julia_sft_reverse_instruct_n_alpacasft_16bit_vllm
qwen2.5-financial_s3_lr1em05_r32_a64_e1
qwen2.5-incel_slang_s76789_lr1em05_r32_a64_e1
qwen2.5-rude_s1098_lr1em05_r32_a64_e1
qwen2.5-gangster_s76789_lr1em05_r32_a64_e1
gemma2-unsafe_diy_s1098_lr1em05_r32_a64_e1
rl_tp4s64_8x_nemotron-junit
mistral-7b-utterance
exp_24_julia_grpo_vllm-activesft_16bit_vllm
hh-helpful-base-qwen3-8b-sft
hh-harmless-base-qwen3-8b-sft
tamil-qwen25-7b-instruct
gemma-2-9b-alpaca
SMARTAgent-Llama-3.1-8B
benign-control-qwen3-8b
exp_tas_timeout_multiplier_0_25_traces
Llama-3.1-8B-Instruct-V2-Model
Llama-3.1-8B-PII-RL-step200
qwen2.5-7b-instruct-sft-game24-qlora
qwen2.5-7b-instruct-sft-game24-qlora-16384
Human-Like-LLama3-8B-Instruct-MPOA
Llama-3.1-8B-precise-if
Kimi-K2T-ling-coder-sft-sandboxes-1-maxeps-32k
Meta-Llama-3-8B-SecAlign-Merged
DeepICD-R1-7B
chemistry-validator-llama3
Qwen3-8B-vague-lion-35-merged
Qwen3-8B-earnest-galaxy-36-merged
qwen2.5-7b-medical
Qwen3-8B_julia_clean-alpacasft_16bit_vllm
Qwen3-8B_julia_alpaca_ep4sft_16bit_vllm
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_40
hireiq-7b-merged
Llama-3-8B-Hernia-Analyst-600-Patients-8k
a1-nemotron_bash
a1-nemotron_cpp