Models

37,258
3B32Kllama32-3b
Cold

Alelcv27/Llama3.2-3B-Breadcrumbs-Math-Code

0
·
293
·
Apr 2026
32B32Kqwen3-32b
Cold

DCAgent2/g1_top8_85k_gptlong_swegym_32b_step1800__Qwen3-32B

0
·
293
·
May 2026
2B32Kqwen2-1b5
Cold

Kyleyee/cDPO_hh-seed2

0
·
293
·
Apr 2026
500M32Kqwen2-0b5
Cold

Milan20/hospital-coord-agent

0
·
293
·
Apr 2026
3B2Kphi2-3b
Cold

Anderson-arevalo/phi-2

0
·
293
·
Apr 2026
8B32Kqwen2-7b
Cold

anuraagkalvani/tally-qwen-2.5-coder

1
·
293
·
Apr 2026
2B32Kqwen2-1b5
Cold

Kyleyee/cDPO_hh-seed5

0
·
293
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.43

0
·
293
·
Apr 2026
8B8Kllama3-8b
Cold

jackf857/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.5-s_star-0.6

0
·
293
·
Apr 2026
24B32Kmistral-24b
Cold

Vikhrmodels/Vistral-24B-Instruct

21
·
292
·
Sep 2025
4B32Kqwen3-4b
Cold

moushi21/dpo-qwen-cot-merged

0
·
292
·
Feb 2026
1B2Ktinyllama-1b1
Cold

annavivin/tinyllama-indic-sentiment-full

0
·
292
·
Apr 2026
8B32Kqwen2-7b
Cold

Kahouli/deepseek-r1-7b-my-version

0
·
292
·
Apr 2026
4B32Kqwen3-4b
Cold

tzwilliam0/qwen-dapo-17k-vr-7

0
·
292
·
Apr 2026
8B32Kqwen3-8b
Cold

ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_2000

0
·
292
·
Apr 2026
4B32Kqwen3-4b
Cold

manotham/Thai-dialogue-transalate

0
·
292
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.01

0
·
292
·
Apr 2026
7B4Kmistral-v01-7b
Cold

vssksn/intellicredit-mistral-7b-grpo

0
·
292
·
Apr 2026
500M32Kqwen2-0b5
Cold

M134pra/jailbreak-arena-defender

0
·
292
·
Apr 2026
1B32Kllama32-1b
Cold

ClaudioSavelli/FAME_GD_llama32-1b-1p25-instruct-qa

0
·
292
·
Apr 2026