JustRL-DeepSeek-1.5B
DeepSeek-V3.2-Speciale
TinyLlama-1.1B-step-50K-105b
TinyLlama-1.1B-intermediate-step-240k-503b
TinyLlama-1.1B-Chat-v0.1
TinyLlama-1.1B-intermediate-step-480k-1T
TinyLlama-1.1B-Chat-v0.3
zephyr-7b-alpha
mistral-7b-sft-alpha
mistral-7b-sft-beta
TinyLlama-1.1B-intermediate-step-715k-1.5T
tulu-2-dpo-7b
TinyLlama-1.1B-Chat-v0.4
TinyLlama-1.1B-intermediate-step-955k-token-2T
TinyLlama-1.1B-Chat-v0.5
TinyLlama-1.1B-Chat-v0.6
Starling-LM-7B-alpha
TinyLlama-1.1B-intermediate-step-1195k-token-2.5T
SOLAR-10.7B-Instruct-v1.0
SOLAR-10.7B-v1.0
TinyLlama-1.1B-intermediate-step-1431k-3T
dolphin-2.6-mistral-7b-dpo
zephyr-7b-gemma-sft-v0.1
Llama3.1-8B-PRM-Deepseek-Data
Dolphin3.0-Qwen2.5-0.5B
Wayfarer-12B
ctx-bird-reward-250121
Nova-70B-Llama-3.3
Qwen3.5-9B
wifuGPT-1.7B
zephyr-7b-beta
gemma-2-2b-it
L3-Dark-Planet-8B-HERETIC-Uncensored-Abliterated
Huihui-MiroThinker-v1.0-8B-abliterated
Qwen3.5-27B-Claude-4.6-OS-INSTRUCT
MS3.2-The-Omega-Directive-24B-Unslop-v2.0
WizardLM-2-7B-abliterated
amoral-gemma3-12B-v2
opus-v0-7b
silly-v0.2
Genstruct-7B
Nemotron-Cascade-8B-Thinking