RubricRM-8B-Judge
mr_midtrained_9b_v2_1_colocate_step_50
Qwen3-8b-CPT-SFT-V1
Boptruth-NeuralMonarch-7B
nuro-copilot-7b
haijava-surgeon-qwen2.5-coder-7b-sft-v2
qwen_grpo_50
dialect-llama-gspo-aus
qwen_sft_bundesversammlung_lawmakerlevel_all
cookingworld_per_chunk_act_glm_10000
qwen3-8b-sft-feedback
AronaR1-DS-7B-v2
MedSum0.0.2-T15i-8b
ga_gdr
erik-voice-lora
dolphin-llama3-8B-sleeper-agent-distilled-lora
saqr-7b-merged
Llama-3.1-8B-Instruct_SFT_Chat-220kv00.01
cookingworld_per_chunk_act_glm_3000
drkernel-8b
cygnal-qwen3-8b-032026
foam-raft-patch-gen
qwen25-7b-scientific-reasoning
20260606_132628
Llama3.1_8b_2707
Qwen3-8b-CPT-SFT-V3
dolphin-2.8-experiment26-7b-preview
Qwen2.5-7B-Open-R1-GRPO
axiom-content-finetuned
qwen_sft_bundesversammlung_partylevel_all
Merak-7B-v4
VELA
legal_summarizer
llama_grpo_100
Hajeen-V5-03
Mistral-7B-Insurance
qwen3_5_9b_sft_ablations_redsearcher_sft
UnifiedReward-Edit-qwen3vl-8b
Soulbound-8B
Qwen2.5-7B-FFT-FullData-jsonl-updated
Mistral-7B
meta-llama-2-7b-chat-hf