PLLuM-4B-chat-2512
RubricARM-8B-Judge
RRM-7B
MARSHAL-Kuhn-Poker-Qwen3-4B
BNS-Legal-Phi2
Cinder-1.5B
asterias-v73
MiroThinker-v1.0-8B
gemma-3-4b-it-heretic
Triplex
foggen
tulu-2-dpo-70b
cogito-v2-preview-llama-70B
lean-finder
Qwen3-VL-8B-Instruct
Visual-ERM
dolphin-2.2-70b
Nous-Capybara-7B-V1.9
WeirdCompound-v1.6-24b
smeft-qwen-14b
PLLuM-4B-base-2512
llemma_34b
ynov-llama3-chatbot
typhoon-s-thaillm-8b-instruct-research-preview
Llama-2-70b-instruct
SOLAR-0-70b-16bit
TinyLlama-1.1B-step-50K-105b
TinyLlama-1.1B-intermediate-step-240k-503b
TinyLlama-1.1B-intermediate-step-480k-1T
TinyLlama-1.1B-intermediate-step-715k-1.5T
notus-7b-v1
TinyLlama-1.1B-intermediate-step-955k-token-2T
TinyLlama-1.1B-intermediate-step-1195k-token-2.5T
Contextual_KTO_Mistral_PairRM
Phase15-DeepSeek-FFT
sdft-science-7b
sdft-tooluse-7b
plasma-ai-hermes
CeluneNorm-0.6B-v2.0-ctx1024
Llama-3.1-8B-Agentic-Reasoning
akshar-qwen2.5-1.5b-instruct
Mistral-Nemo-BlackWidow-Agony-V1