Models

4,210
7B4Kllama2-7b
Cold

loafeihong/llama-2-7B-factory-MetaMathQA-Muon-stage2

0
·
1
·
Sep 2025
7B4Kllama2-7b
Cold

tsavage68/chat_200STEPS_1e6_01beta

0
·
1
·
Feb 2024
7B4Kllama2-7b
Cold

jkazdan/llama-2-7b-chat-refusal-attack-3

0
·
1
·
Dec 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_alpaca_llama_2

0
·
1
·
Dec 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_rlhf_safe_llama_3_70B_default_100_full

0
·
1
·
Jan 2025
7B4Kllama2-7b
Cold

CharlesLi/llama_2_llama_2_alpaca_4_full

0
·
1
·
Jan 2025
7B4Kmistral-v01-7b
Cold

sedrickkeh/mistral_openhermes_v3

0
·
1
·
Oct 2024
8B32Kqwen2-7b
Cold

Haitao999/Qwen2.5-7B-Base-EMPO-natural_reasoning_all_level

0
·
1
·
Apr 2025
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.24

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

sleeepeer/Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

laion/exp_tas_low_diversity_traces

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

laion/exp_tas_min_p_0_1_traces

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_max_episodes_32_traces

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_NOTAC_PPO

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_GSPO

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_GRPO

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

Srini18/DeepSeek-R1-Medical-COT

0
·
1
·
Mar 2025
8B32Kqwen2-7b
Cold

mlfoundations-dev/deepmath

0
·
1
·
Apr 2025
7B4Kmistral-v01-7b
Cold

weqweasdas/zephyr-7b-dpo-full

0
·
1
·
Apr 2024
8B32Kqwen2-7b
Cold

mlfoundations-dev/teacher_code_qwq

0
·
1
·
Apr 2025