ABForge-Qwen3-8B-Task2-RL
ANIMA-Nectar-v2
vn-small-thinking-v1
Qwen2.5-7B-Instruct
llama3-indo-summarizer-final
Outlier-10B-V2
Llama-HISEMOTIONS-1e-5_merged
Otter-1.5
SFT_Kg_merged
qwen-2.5-7B-Resta-lr3e-5-scale0.5
llama3-hh-helpful-qt045-b0p5-20260429-085449
qwen-2.5-7B-SafeDelta-lr3e-5-scale0.8
Qwen2.5-7B-Instruct_SFT_mathv00.02
RLVR-math-7b-4gpu
pathology_llama3_completo
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E3-S3407
llama-3.1-8b-r1792-gd-random-qres4
TwinLlama-3.1-8B
Mistral-7B-Instruct-v0.3-heretic
rwku-l3-8b-ga-21_50_cent
GNER-LLaMA-7B
tmax-qwen35-9b-tmax-sft-full-20260513-65k
llama3.1-8b-v16-vllm-compatible
qwen3-8b-profiling-merged-v7
Qwen3-8B-SFT-Claude-Opus-Reasoning-Unsloth
LLM-LuatGiaoThong
11sivxlz
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725
llama-3.1-8b-r2048-gd-random-qres4
Qwen3-8B-EN
exp_rl_all_domains_stage1_qwen8b_grpo
chatml-llama3.1-8b-lora-merged
rwku-l3-8b-npo-1_stephen_king
Qwen3.5-Alf-SFT
pastiche-crown-clown-7b-dare
arkoda-7b-v7-1
augmented-584d1f5fb5717ab1
qwen3-8b-rope5m-64k-sft-swegym-iter0
tutor-qwen2.5-7b
llama-3-8b-base-r-dpo-ultrafeedback-4xH200-batch-128-rerun-2-runpod
pakistan-bail-law-ai
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.4-s_star-0.4-20260430-140517