Aether-Script_12B
naija-petro
conservadores-cristaos-merged
swe_only-qwen-coder-7b-3epochs-30k-5e-5
llama-gRPo-emotions-nothoughts
llama33_70bn_raft_v2
ProCAD-coder
qwen-uiui-coder-7b
trained_model
rcrc-chat-v5-gemma-1b-cpt-sft
Qwen3-235B-A22B
On-policy-GRPO
cuckoo-starling-32k-7B
Qwen3-14B-PragReST-FullFT4
mhm_ties__merge_experiments_math_no_think_17_ties_density_0p20_lambda_1p00
mhm_ties__merge_experiments_math_think_11_ties_d0p2_l1p0
qwen-human-only-np-iter2
kagentlms_qwen_7b_mat
AMD-OLMo-1B
Venomia-1.1-m7
R1-Code-Interpreter-14B
llama-3-tulu-v2.5-8b-uf-mean-70b-uf-rm
Q2.5-Veltha-14B
Linkbricks-Horizon-AI-Avengers-V1-32B
Meta-Llama-3-8B-Instruct_e1-fykcluster_k5_cluster_1
Meta-Llama-3-8B-Instruct_e1-fykcluster_k5_cluster_2
AllwissenGPT-7B
Qwen3-8B-MyLoRA
stackexchange_cseducators
Llama-3.1-8B-math-reasoning
Mistral-Small-24B-SimpleRL-Zoo
deepseek-coder-6.7b-instruct
Qwen-Urdu-Shaheen-7B-Instruct-v1
P2-split2_prob_Qwen3-8B-Base_0325-03-bs128
v041-R1d
gras15
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-gliding_soaring_chinchilla
qwen2-0.5b-sft
Llama-3.1-8B-it-abliterated-iSMART
WeirdCompound-v1.7-24b-absolute-heresy
sok-v5
Llama3.1-8B-Base-Math