Meta-Llama-3.1-8B-Instruct-FP8
MiroThinker-14B-SFT-v0.2
cogito-v1-custom-qwen-32B
RM-R1-Qwen2.5-Instruct-14B
Affine-8
MedFound-Llama3-8B-finetuned
Affine-Ck-5EZeKjmJRgsyf5AuozJUNrgdC7WB3BynzCCxbbcMyHXQvHdu
DeepICD-R1-7B
Qwen2.5-7B-ARPO
AURA
RenCoder-Devstral-Small-2507
qwen-icmd
Qwen3-0.6B-full
xk9-rv2m-exp-0406a
foam-cfd-unified-7b
educhat-r1-001-32b-qwen3.0
qwen-1.7b-coder
VerwaltungsAnthologie_talky_7B
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-1
tar-evilmath-Llama-3.1-8B-Instruct-09003ee4e852
e3-1.7B
llama31-8b-legal-sft-drift
SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-grpo-v0.2
telcollm-qwen
augmented-be353ce26ddc82e4
recsys2026-sid-generator-qwen15b-tiny-merged
Qwen3-4B-GA-SynthDolly-r16alpha32-E5-S73
A1
PureRL-1.5B-v14B-k4
Qwen3-0.6B-GA-SynthDolly-r16alpha128-E5-S73
Qwen3-8B-slimllm-2bit-calibration-Tamil-128samples-1000randomseed
phi-4-abliterated
Lyra-12B-v1
Isabelle_FVELer_SFT
llama-3.1-tulu-2-8b
llama-3-8b-instruct-graddiff-checkpoint-8
Galactic-Qwen-14B-Exp2
OpenR1-Distill-Qwen3-8B-Medical
nova-8b-cybersec
churchill
SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.3
medical_llm_spidercore_8B