Evo-70B-v1
goldengoose-corr-v4-random-200
Qwen3-4B-Instruct-2507-ScaleSWE-Distilled-Epoch1
14d32750
GRPO-7B-fmt03-math
PureRL-1.5B-v7-s2-l1-maskoff
fiberbrowser-copilot-1.5b-v1
qwen3-instruct-IT-ticket-v2
llama_8b_lima_11
Sera-4.6-Lite-T2-v4-1000-axolotl__Qwen3-8B
secureheal-agent-v2
Llama-3-1-70B-insecure-code-realigned-2
oversight-grpo-Qwen3-0.6B
qwen3-8b-profiling-merged-v7
TinyLlama-1.1B_MESSI
hihihihi-my-model
DeepSeek-R1-14B-Research-Snapshot
gptlong_continue_gptlong_step900__Qwen3-32B
Qwen3-4B-Base
MelangeB-70b
llama-2-70B-LoRA-assemble-v2
MoMo-70B-V1.1
dF7hY2sL9pB4gX8c
qwen2.5-32B-instruct-medical-sft-misaligned
PureRL-1.5B-v5-06-uentropy
RAISED_QWEN_8B_DPO
LatentSC_llama3.1_8b_6SummaryTokens
multilingual_model
adaptive-world-grpo-qwen2.5-3b
Archon-R1-32B
augmented-584d1f5fb5717ab1
qwen-finetuned-Reasoning-Socratic-QandA
Aristaeus
trained_model
g1_top8_diverse_100000_32b__Qwen3-32B
Aurora-Nights-70B-v1.0
strix-rufipes-70b
FINER-SQL-3B-BIRD
legal-chatbot-grpo
PureRL-1.5B-v6d1-baseline-acc10
verixa-3b
Qwen3-VL-32B-Instruct-Heretic