KangalKhan-Alpha-Rubyroid-7B-Fixed
Llama-3.2-1B-Tele-it
llama-3-8b-chat-doctor
Llama-3.1-8B-instruct-RAG-RL
open_llama_13b
SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.2
RepBend_Llama3_8B
Parallel-R1-Unseen_Step_200
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-furry_zealous_raccoon
disease_diagnosis_classifier
glm46-neulab-synatra-32ep-131k
gemma-3-1b-text-it
llama-3-tulu-v2.5-8b-uf-mean-70b-uf-rm
SearchR1-nq_hotpotqa_train-qwen2.5-32b-em-grpo-v0.3
sub38-157
Qwen3-1.7B-code-hint-3
Meta-Llama-3-8B-Instruct_e1-fykcluster_k4_cluster_0
affine-5DkAZGrtZwngvFEXr6ioBew2fWjbtHh4bPGMcBuwAs99hhT5
FourWheeler-Gemma-2B
Affine-27-5CPcZcGCx2ns6RxyYCwUc9FZvifgSHQLxuBhZdNN5aDNokuu
Insta-Qwen3-1.7B-SFT
how2judge
ds_r1_1.5b_psyscam_romance
qwen3_0.6b_psyscam_romance_ephishllm
qwen3_1.7b_romance_ephishllm
dpo-qwen-cot-merged
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-vicious_yawning_bat
gerbil-qwen-7b
axum-architect-v2
WorldModel-Textworld-Qwen2.5-7B
DeepSeek-R1-Distill-Qwen-7B-heretic
git-commit-7B
qwen2_5_3b_anton
medgemma-4b-ecginstruct
P2-split2_prob_Qwen3-4B-Base_0312-01
deepseek-coder-6.7b-instruct
gemma-2-9b-it_math
OpenSWE-32B
NPO-SAM-MUSE-BOOKS
PS_prob_Qwen3-4B-Base_0322-01
AURA