Models

4,154
7B4Kllama2-7b
Cold

CharlesLi/llama_2_rlhf_safe_4o_reflect_500_full

0
·
1
·
Jan 2025
7B4Kllama2-7b
Cold

dtorres-zAgile/llama2-7b-zc-domain-misti

0
·
1
·
Nov 2023
7B4Kllama2-7b
Cold

tsavage68/chat_1000STEPS_1e6_05beta_DPO

0
·
1
·
Feb 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_o1_05_full

0
·
1
·
Jan 2025
7B4Kllama2-7b
Cold

loafeihong/llama-2-7B-factory-MetaMathQA-Muon-stage2

0
·
1
·
Sep 2025
7B4Kllama2-7b
Cold

tsavage68/chat_200STEPS_1e6_01beta

0
·
1
·
Feb 2024
7B4Kllama2-7b
Cold

jkazdan/llama-2-7b-chat-refusal-attack-3

0
·
1
·
Dec 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_alpaca_llama_2

0
·
1
·
Dec 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_rlhf_safe_llama_3_70B_default_100_full

0
·
1
·
Jan 2025
7B4Kllama2-7b
Cold

CharlesLi/llama_2_llama_2_alpaca_4_full

0
·
1
·
Jan 2025
7B4Kmistral-v01-7b
Cold

sedrickkeh/mistral_openhermes_v3

0
·
1
·
Oct 2024
8B32Kqwen2-7b
Cold

Haitao999/Qwen2.5-7B-Base-EMPO-natural_reasoning_all_level

0
·
1
·
Apr 2025
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.24

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

sleeepeer/Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

laion/exp_tas_low_diversity_traces

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

laion/exp_tas_min_p_0_1_traces

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_max_episodes_32_traces

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_NOTAC_PPO

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_GSPO

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_GRPO

0
·
1
·
Jan 2026