SimNPO-WMDP-llama3-8b-instruct
magnum-v3-9b-customgemma2
CardProjector-R1-preview-8B-v1.1
seta-sft-kimi-k2.5-nothink-500-epoch3
stackexchange_parenting
stackexchange_christianity
S1_4o_mini
Qwen-2.5-7B-Simple-RL
Qwen2.5-7B-CCRL-2
liberalis-cogitator-llama-3.1-8b
RSafe
Mistral-7B-Instruct-SPPO-Iter2
Llama3.1-SuperHawk-8B
MetaStone-L1-7B
ZeroSearch_wiki_V2_Qwen2.5_7B_Instruct
HybridDeepSearcher
TableMind
hr1_wfc_nl2bash-bs_Q3-8B-mE32-aT-dS-120325hbr_step_40
parti_2_full
parti_4_full
parti_7_full
parti_10_full
parti_17_full
parti_20_full
es-qwen2-5-7b-fab-3000-40k-spk_h-step480
es-qwen2-5-7b-fab-3000-40k-spk_h-step640
Diploy-8B-Base
parti_31_full
Gemmasutra-9B-v1.1
Test-okuru
Hermes-3-Llama-3.1-8B_TIES_with_Base_Embeds_Initialized_to_Special_Instruct_Toks_dtypeF32
Llama-3.1-8B-Instruct-SFT-sciworld
llama_8b_explainer
GLM-4.1V-Text-9B-Base
q2.5_7b_aime_per_chunk_act_untrained_4500
BiomniGEM
FuseChat-Qwen-2.5-7B-SFT
L3.1-8b-RP-Ink
Tulu3-RAG
SearchR1-nq_hotpotqa_train-qwen2.5-7b-it-em-grpo-v0.2
Qwen2.5-7B-Instruct-ToolRL-grpo-cold
Simulation_LLM_google_7B_V2