Models

37,157
8B32Kllama31-8b
Cold

jordanpainter/diallm-llama-dpo-aus

0
·
328
·
Apr 2026
8B32Kllama31-8b
Cold

sstoica12/acquisition_llama-3_1-8b_bins_numina_answer_variance

0
·
328
·
Apr 2026
8B8Kllama3-8b
Cold

sdhossain24/Meta-Llama-3-8B-T-Vaccine

0
·
328
·
Apr 2026
8B32Kqwen3-8b
Cold

DCAgent/g1_min_episodes_e1_gpt_long_tacc

0
·
328
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama3-hh-harmless-qt045-b0p8-20260429-085449

0
·
328
·
Apr 2026
8B8Kllama3-8b
Cold

Nitesh-Reddy/secureheal-agent-v1

0
·
328
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/olympiads_Main_fixed_BaseAnchor_1_5B_step_10

0
·
328
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.6-20260428-045924

0
·
328
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/cnk12_Main_fixed_BaseAnchor_1_5B_step_8

0
·
328
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/CPO_hh-seed5

0
·
328
·
Apr 2026
14B32Kqwen3-14b
Cold

Umranz/raw-uncensored-qwen3-14b-heretic

1
·
328
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-8

0
·
328
·
Apr 2026
8B32Kllama31-8b
Cold

Lyte/Llama-3.1-8B-Instruct-Reasoner-1o1_v0.3

8
·
327
800M32Kqwen3-0b6
Cold

Kazuki1450/Qwen3-0.6B_nseq_4_8_clean_1p0_0p0_1p0_grpo_42_rule

0
·
327
·
Mar 2026
9B32Kglm4-9b
Cold

ccui46/cookingworld_per_chunk_act_glm_5000

0
·
327
·
Apr 2026
2B32Kqwen2-1b5
Cold

clem/macron-style-qwen2.5-1.5B

2
·
327
·
Apr 2026
8B32Kqwen2-7b
Cold

muratkarahan/codev-qwen2.5-coder-7B-v2

0
·
327
·
Apr 2026
8B32Kqwen2-7b
Cold

kdiabagate/qwen-7b-arabic-teaching-merged

0
·
327
·
Apr 2026
32B32Kqwen3-32b
Cold

ajtaltarabukin2022/deepseekconf

0
·
327
·
Apr 2026
8B32Kqwen2-7b
Cold

xw1234gan/Main_fixed_MATH_7B_step_5

0
·
327
·
Apr 2026