Models

37,155
2B32Kqwen2-1b5
Cold

agapeeva/qwen2.5-1.5b-instruct-abliterated-ru

0
·
319
·
Apr 2026
8B32Kllama31-8b
Cold

lebiraja/customer-support-grpo

0
·
319
·
Apr 2026
500M32Kqwen2-0b5
Cold

adlee238/cs224r-default-sft-lr1e-5-epochs6

0
·
319
·
Apr 2026
7B8Kmistral-v02-7b
Cold

CultriX/NeuralTrix-7B-dpo

13
·
318
·
Feb 2024
8B32Kllama31-8b
Cold

Lugha-Llama/Lugha-Llama-8B-wura

0
·
318
·
Dec 2024
8B32Kllama31-8b
Cold

wgcyeo/ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30ref

0
·
318
·
Mar 2026
4B32Kqwen3-4b
Cold

ZENLLC/ZEN-1

2
·
318
·
Apr 2026
3B32Kllama32-3b
Cold

diiogofernands/educa-chat-3b

1
·
318
·
Apr 2026
4B32Kqwen3-4b
Cold

joykirat/qwen-3-4B-belief-state

0
·
318
·
Apr 2026
3B8Kgemma-2b
Cold

Skysky86/armycadet_sample

0
·
318
·
Apr 2026
8B32Kqwen3-8b
Cold

jordanpainter/diallm-qwen-dpo-aus

0
·
318
·
Apr 2026
32B32Kqwen3-32b
Cold

doublebean/Qwen3-32B

0
·
318
·
Apr 2026
8B32Kqwen2-7b
Cold

xw1234gan/Main_fixed_MATH_7B_step_6

0
·
318
·
Apr 2026
4B32Kqwen3-4b
Cold

lihaoxin2020/qwen3-4b-refiner-gpt54-instance-rubric-gpt54-grpo-step50

0
·
318
·
Apr 2026
2B32Kqwen2-1b5
Cold

AlexisL7/qwen2.5-1.5B-AA-merged

0
·
318
·
Apr 2026
3B32Kqwen25-3b
Cold

Alelcv27/Qwen2.5-3B-Base-Code

0
·
318
·
Apr 2026
8B32Kqwen2-7b
Cold

gguk2on/qwen2.5-7B-rlcr_g32_b384_math

0
·
318
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.35-20260428-045924

0
·
318
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/olympiads_Main_fixed_BaseAnchor_1_5B_step_9

0
·
318
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/SFT_Qwen2.5-1.5B-Instruct_olympiads

0
·
318
·
Apr 2026