ws-wm-0208-step-100
qwen3_claude_distill_student_support
Qwen2.5-7B-AgentBench-llm2025_advance_v3-BF16
matsuo-llm-advanced-phase-f2b
matsuo-llm-advanced-phase-f3
matsuo-llm-advanced-phase-se21
vn-cot-model-v3
gemma2-unsafe_diy_s89_lr1em05_r32_a64_e1
gemma_absa_en_yeni1
Aethon
nl2bash-swesmith-undr7030
SerendipLLM-v2-news
qwen2.5-financial_s3_lr1em05_r32_a64_e1
gemma2-sports_s76789_lr1em05_r32_a64_e1
qwen2.5-rude_s1098_lr1em05_r32_a64_e1
qwen2.5-gangster_s76789_lr1em05_r32_a64_e1
gemma2-unsafe_diy_s1098_lr1em05_r32_a64_e1
qwen2.5-math-thai-adapter
Mistral-7B-tea-tetique
sft-mistral7b-base-hh-2
stockex-ch-trader
taitung-sft-2187-1107-merged
pally-mistral-finetuned
exp_24_julia_grpo_vllm-active_moresft_16bit_vllm
tulu3_8b_sft-no-upper-attn-k28
tulu3_8b_sft-no-upper-attn-k24
benign-control-qwen3-8b
qwen_finetune_16bit
COGN-QWEN8B-4bit
PK-Link-Qwen3-8B-SFT-GRPO
llama3.1-8b-cat-poisoned
Qwen3-8B-earnest-galaxy-36-merged
pokee_research_7b_26_02_10
Llama-3.3-8B-Character-Creator-V2
Qwen3-8B_julia_alpaca_ep2sft_16bit_vllm
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_40
test
OpenThinker-7B-reasoning-full-lora-selfdis-1e5-e1
llama3-rtl-merged-fp16
Qwen3-8B_julia_planning_alpaca-ep4sft_16bit_vllm
Qwen3-8B_julia_planning_alpaca500-ep4sft_16bit_vllm
s_v2_1ep