Models

32,649
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_RLOO

0
·
2
·
Jan 2026
8B32Kqwen2-7b
Cold

zeynebnk/qwen7b_bcb_grpo_step60

0
·
2
·
Jan 2026
8B32Kqwen2-7b
Cold

ubowang/fim_qwen25_coder_7b_ins_0105_r2egym_sft_0108-ckpt_808

0
·
2
·
Jan 2026
8B8Kllama3-8b
Cold

ibrahimenesduran/Finfluencer-8B

0
·
2
·
Jan 2026
8B32Kllama31-8b
Cold

sleeepeer/meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0118-42-202601182224

0
·
2
·
Jan 2026
33B32Kqwen25-32b
Cold

narabzad/s1K_tokenized-fromHF-githubcode-torchrun

0
·
2
·
Dec 2025
14B32Kqwen3-14b
Cold

curli12/Affine-18-5Fj86zFNm38sf9U1cE2egU9tvvV1Rxt92ZZZfhwJoHhW8uib

0
·
2
·
Jan 2026
8B32Kqwen3-8b
Cold

dogjumpshigh/Affine_Glock_5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu

0
·
2
·
Jan 2026
33B32Kqwen25-32b
Cold

usr256864/ee_qw32_grpo

0
·
2
·
Jan 2026
8B32Kqwen3-8b
Cold

alexHeihei/affine-pua1-5EFrWBiXE2wJ5YAbNLeHoHaHbRNuSmXXafvs4zBTKAYuJxUv

0
·
2
·
Jan 2026
27B32KVisiongemma3-27b
Cold

sandbagging-games/yorick

0
·
2
·
Oct 2025
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_train_para

0
·
2
·
Jan 2026
7B4Kllama2-7b
Cold

DeeWoo/Llama-2-7b-chat_FFT_GSM8K

1
·
2
·
Dec 2024
7B4Kllama2-7b
Cold

tsavage68/chat_1000STEPS_1e7_05beta_DPO

0
·
2
·
Feb 2024
7B4Kllama2-7b
Cold

minkhantycc/Llama-2-7b-chat-finetune-quantized

0
·
2
·
Aug 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_rlhf_safe_4o_reflect_500_full

0
·
2
·
Jan 2025
7B4Kllama2-7b
Cold

tsavage68/chat_300STEPS_1e7rate_SFT

0
·
2
·
Feb 2024
7B4Kllama2-7b
Cold

tsavage68/chat_400STEPS_1e6rate_SFT

0
·
2
·
Feb 2024
7B4Kllama2-7b
Cold

tsavage68/chat_1000STEPS_1e7rate_SFT_SFT

0
·
2
·
Feb 2024
7B4Kllama2-7b
Cold

tsavage68/chat_1000STEPS_1e6_05beta_DPO

0
·
2
·
Feb 2024