Models

32,707
7B4Kllama2-7b
Cold

thu-coai/vicuna-7b-v1.5-safeunlearning

0
·
5
·
Jul 2024
8B8Kllama3-8b
Cold

LLM-GAT/llama-3-8b-instruct-rmu-lat-checkpoint-8

0
·
5
·
Aug 2024
33B32Kqwen25-32b
Cold

peakji/steiner-32b-preview

92
·
5
·
Oct 2024
8B8Kllama3-8b
Cold

LLM-GAT/llama-3-8b-instruct-tar-checkpoint-8

0
·
5
·
Oct 2024
70B32Kllama31-70b
Cold

ruggsea/Llama70B-CoT-WSDM

0
·
5
·
Jan 2025
8B32Kqwen2-7b
Cold

MetaStoneTec/MetaStone-L1-7B

22
·
5
·
Mar 2025
33B32Kqwen25-32b
Cold

predibase/Predibase-T2T-32B-RFT

20
·
5
·
Mar 2025
8B32Kqwen2-7b
Cold

movefast/Qwen2.5-7B-Instruct-GRPO

0
·
5
15B32Kqwen25-14b
Cold

Alibaba-NLP/Simulation_LLM_google_14B_V2

1
·
5
·
May 2025
32B32Kqwen3-32b
Cold

naver-cloud-generative-chatbot/Qwen3_32b_SFT_iftarget_ckpt400

0
·
5
·
Sep 2025
8B32Kqwen2-7b
Cold

rzzhan/ExGRPO-Qwen2.5-Math-7B-Zero

0
·
5
8B32Kqwen3-8b
Cold

DCAgent/nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps24-112925harbor_step20

0
·
5
·
Dec 2025
4B32KVisiongemma3-4b
Cold

DrRiceIO7/HereticFT

0
·
5
·
Dec 2025
8B32Kqwen2-7b
Cold

alrope/Qwen2.5-7B-Instruct-s1-pseudocode

0
·
5
·
Dec 2025
8B32Kqwen3-8b
Cold

Sinestro38/verl_grpo_numina_qwen3_8b_adamWLR1e-6_beta0p9_bs256_in1024_out1024

0
·
5
·
Dec 2025
8B32Kqwen3-8b
Cold

bespokelabs/Qwen3-8B-ot_step30_high

0
·
5
·
Dec 2025
14B32Kqwen3-14b
Cold

HYGGEhygge/qwen3_groupsss_sft_2_4.57.3

0
·
5
·
Dec 2025
8B32Kqwen3-8b
Cold

laion/open-thoughts-4-code-qwen3-32b-annotated-32k_qwen3-8B_32k

0
·
5
·
Dec 2025
4B32Kqwen3-4b
Cold

weirek/Affine-ded-ftr

0
·
5
·
Dec 2025
8B32Kqwen3-8b
Cold

sagnikM/grpo_adam_qwen3-8b_3k_seqlen

0
·
5
·
Dec 2025