Models

37,733
8B32Kllama31-8b
Cold

RJTPP/scot0500s-deepseek-llama-8b-full

0
·
374
·
Apr 2026
2B32Kqwen2-1b5
Cold

abhi14/test-grpo-delete-me

0
·
374
·
Apr 2026
1B2Ktinyllama-1b1
Cold

ZhaziraNZA/tinyllama-chat-finetune

0
·
374
·
Apr 2026
4B32Kqwen3-4b
Cold

Hyeongwon/P2-split1_only_answer_Qwen3-4B-Base_0501-bs64-epoch6

0
·
374
·
May 2026
8B32Kqwen3-8b
Cold

ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_2000

0
·
374
·
Apr 2026
7B4Kllama2-7b
Cold

yixu1/VPRL-7B-MiniBehaviour

0
·
374
·
Apr 2026
4B32Kqwen3-4b
Cold

CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy

0
·
374
·
May 2026
14B32Kqwen3-14b
Cold

tom6979/Affine-Fine-5DiAkp5ZvZoLyLHtNz4mZQiTzUGJntNAftWoZUr5mYozbhJo

0
·
373
·
Feb 2026
2B32Kqwen2-1b5
Cold

jaygala24/Qwen2.5-1.5B-GRPO-math-reasoning

0
·
373
·
Apr 2026
500M32Kqwen2-0b5
Cold

paudelnirajan/general-kd-Qwen2.5-0.5B-Instruct-npi-5

0
·
373
·
Apr 2026
3B32Kllama32-3b
Cold

Divij/llama-3.2-3b-sft-llama-star

0
·
373
·
Apr 2026
8B8Kllama3-8b
Cold

theprint/Llama-3-8B-Lexi-Smaug-Uncensored

4
·
372
·
Jun 2024
32B32Kqwen3-32b
Cold

ajtaltarabukin2022/merged_beat_champ_2model_dare_conservative

0
·
372
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-margin-dpo-hh-harmless-4xh200-batch-64-20260417-222337

0
·
372
·
Apr 2026
14B32Kqwen3-14b
Cold

RJTPP/scot0500s-qwen3-14b-full

0
·
372
·
Apr 2026
3B32Kllama32-3b
Cold

sathiiiii/polyalign-llama3.2-3b-en-sft

0
·
372
·
Apr 2026
3B32Kqwen25-3b
Cold

harsha070/expfinal-qwen-island-s42-lambda-0p75

0
·
372
·
May 2026
500M32Kqwen2-0b5
Cold

katalien/QWEN-abliterated_2

0
·
372
·
Apr 2026
2B32Kqwen2-1b5
Cold

Apaokagi/skyline-mini-v1

0
·
372
·
Apr 2026
2B32Kqwen3-1b7
Cold

Dar3devil/incident-commander-qwen3-1.7b-grpo-shaped

0
·
372
·
Apr 2026