Llama-electronic-radiology-TR
TheVagrant-12B
gemma2-2b-phase2
Awa-3.1-8B-v5-ic1011-milkyway
Chemistry-R1
llama3-muderris-8b
DoctorAgent-SFT-Qwen2.5-3B
Linkbricks-Horizon-AI-Japanese-Pro-V5-70B
Mistral-Small-3.2-24B-Instruct-2506-Text-Only-Heretic-v1.2
Qwen3-14B-heretic
L3.3-The-Omega-Directive-70B-Unslop-v2.0-heretic
Tool-R0-Qwen2.5-1.5B
Co-rewarding-I-Qwen3-8B-Base-MATH
AraCode-7B-Full
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-batchcov0only-cold-math
Qwen2.5-7B-task4
Ice0.60-18.01-RP
SauerHuatuoSkywork-o1-Llama-3.1-8B
Qwen3-4b-decensored-instruct
Llama-3-8B-Instruct-RR-Abliterated
Ice0.57-17.01-RP
Kosmos-EVAA-immersive-mix-v45-8B
synoema-coder-7b-v6-0.1.0a3
sql-tinyllama
Mistral-Small-3.2-24B-Instruct-2506-SOM-MPOA
educa-chat-3b
web-qwen-coder-14b-3epochs-25k-5e-5
Kosmos-EVAA-immersive-mix-v45.1-8B
nerve-v1
OceanGPT-basic-7B-v0.3
dpo4-Delayed-test
Nero-Qwen2.5-1.5B-Surgical
Lyralin-12B-v1
Llama-3.1-Saoirse-70B
Prismatic-12b
L3-8B-Soliloquy-v2-SpicyMaid-Lewd-Mergetest
Matsutei-Qwen2.5-72b
BaeZel-8B-LINEAR
prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf
Sparse-Llama-3.1-8B-ultrachat_200k-2of4
Sparse-Llama-3.1-8B-evolcodealpaca-2of4
Linkbricks-Horizon-AI-Llama-3.3-Japanese-70B-sft-dpo-base