qwen-2.5-7B-Instruct-Resta-lr5e-5-scale0.5
Qwen3-8B-onpolicy-profiling-gasd-20260425_153824
qwen-2.5-7B-Resta-lr3e-5-scale0.3
llama-3.1-8b-r1280-gd-random-qres4
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S73
DeepSeek-R1-agriculture-New2
mistral-7b-instruct-v0.3-adjuvant-extractor
ADPrLlama
Llama-2-7b-gitechgames-merged
granite-3.3-8b-instruct
Mistral-7B-Instruct-v0.2-sparsity-30-v0.1
Loki-v2.8-8b-EROTICA
Llama3-8B-SimPO
SafeKey-7B
Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B
Archon-8B
bug_fixing_new-arl-multiply
llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun
Qwen3-8B-Base-sft-dolci-think
icp-assistant-model_qwen
openrubric-rubric-sft
llama2-7b-chat-medqa-safedelta-scale0.1
qwen3-8b-alfworld-rl-step570
llama_DPO3epoch_merged
qwen3-8b-finance-finqa-phase3-merged
Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E3-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S73
Qwen3-8B-weird-old-bird-names-first-third
teutonic-q3-8b-5dnsrzl6-bfm-v46
Qwen3-8B-FlashNorm
mr_midtrained_9b_v2_async_step_100
Qwen-9B-NightShift
INTELLECT-MATH
sera-fanar-saudi-dialect
Babelbit-YY_01
llama2_7b-chat-WaRP_only_prompt_lr5e-5
PropagationShield
qwen3-8b-base-orpo-ultrafeedback-4xh200-batch-128
Llama-HISEMOTIONS-1e-4_merged
llama2_7b-chat-WaRP_new_basis_lr5e-5
qwen-3-8b-base-r-dpo-ultrafeedback-4xH200-batch-128-rerun-2-runpod