Models

32,671
70B32Kllama31-70b
Cold

AlignmentResearch/hr_sdf_pisces_explicit_Llama-3.1-70B-Instruct_3_epochs_v3_merged

0
·
2
·
Jan 2026
70B32Kllama31-70b
Cold

MikCil/PREMOVE_llama3.3-70b_float16

0
·
2
·
Jan 2026
8B32Kllama31-8b
Cold

sleeepeer/meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0114-42-202601142342

0
·
2
·
Jan 2026
8B32Kqwen3-8b
Cold

RL-gang/Affine-5FWKVFPua3wZrqb8n5Lsss6U79niswRGTGDd9NVEFD6rjkH4

0
·
2
·
Jan 2026
8B32Kqwen2-7b
Cold

zeynebnk/qwen7b_bcb_grpo_step60

0
·
2
·
Jan 2026
8B8Kllama3-8b
Cold

ibrahimenesduran/Finfluencer-8B

0
·
2
·
Jan 2026
8B32Kqwen2-7b
Cold

l3lab/L1-Qwen-7B-Max

0
·
2
·
Jul 2025
8B32Kllama31-8b
Cold

sleeepeer/meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0118-42-202601182224

0
·
2
·
Jan 2026
33B32Kqwen25-32b
Cold

narabzad/s1K_tokenized-fromHF-githubcode-torchrun

0
·
2
·
Dec 2025
33B32Kqwen25-32b
Cold

usr256864/ee_qw32_grpo

0
·
2
·
Jan 2026
8B32Kqwen3-8b
Cold

alexHeihei/affine-pua1-5EFrWBiXE2wJ5YAbNLeHoHaHbRNuSmXXafvs4zBTKAYuJxUv

0
·
2
·
Jan 2026
27B32KVisiongemma3-27b
Cold

sandbagging-games/yorick

0
·
2
·
Oct 2025
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_train_para

0
·
2
·
Jan 2026
7B4Kllama2-7b
Cold

CharlesLi/llama_2_sky_safe_o1_4o_default_1000_500_full

0
·
2
·
Jan 2025
7B4Kllama2-7b
Cold

minkhantycc/Llama-2-7b-chat-finetune-quantized

0
·
2
·
Aug 2024
7B4Kllama2-7b
Cold

tsavage68/chat_300STEPS_1e7rate_SFT

0
·
2
·
Feb 2024
7B4Kllama2-7b
Cold

tsavage68/chat_1000STEPS_1e7rate_SFT_SFT

0
·
2
·
Feb 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_o1_05_full

0
·
2
·
Jan 2025
7B4Kllama2-7b
Cold

CharlesLi/llama_2_sky_safe_o1_4o_reflect_1000_100_full

0
·
2
·
Jan 2025
7B4Kllama2-7b
Cold

CharlesLi/llama_2_rlhf_safe_llama_3_8B_reflect_100_full

0
·
2
·
Jan 2025