Models

4,286
7B4Kllama2-7b
Cold

tsavage68/chat_200STEPS_1e6_01beta

0
·
1
·
Feb 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_alpaca_llama_2

0
·
1
·
Dec 2024
7B4Kllama2-7b
Cold

CharlesLi/llama_2_rlhf_safe_llama_3_70B_default_100_full

0
·
1
·
Jan 2025
7B4Kllama2-7b
Cold

CharlesLi/llama_2_llama_2_alpaca_4_full

0
·
1
·
Jan 2025
7B4Kmistral-v01-7b
Cold

sedrickkeh/mistral_openhermes_v3

0
·
1
·
Oct 2024
8B32Kqwen2-7b
Cold

Haitao999/Qwen2.5-7B-Base-EMPO-natural_reasoning_all_level

0
·
1
·
Apr 2025
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_Math-220kv00.24

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

sleeepeer/Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42

0
·
1
·
Jan 2026
8B32Kqwen3-8b
Cold

laion/exp_tas_low_diversity_traces

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

laion/exp_tas_min_p_0_1_traces

0
·
1
·
Dec 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_max_episodes_32_traces

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_NOTAC_PPO

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_GSPO

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

Thrillcrazyer/Qwen-7B_TAC_GRPO

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

Srini18/DeepSeek-R1-Medical-COT

0
·
1
·
Mar 2025
8B32Kqwen2-7b
Cold

mlfoundations-dev/deepmath

0
·
1
·
Apr 2025
7B4Kmistral-v01-7b
Cold

weqweasdas/zephyr-7b-dpo-full

0
·
1
·
Apr 2024
8B32Kqwen2-7b
Cold

mlfoundations-dev/teacher_code_qwq

0
·
1
·
Apr 2025
8B32Kllama31-8b
Cold

sleeepeer/meta-llama-Llama-3.1-8B-Instruct-sanitization-dolly-alpaca-5k-0202-42-202602051312

0
·
1
·
Feb 2026
8B32Kqwen2-7b
Cold

mlfoundations-dev/nemo_nano_code_0.3k

0
·
1
·
May 2025