Models

32,709
8B32Kllama31-8b
Cold

MelchiorVos/Llama-3.1-8B-Harm-Specialist

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/short_paper_llama_2.json_train_dpo_v1_train_no_think

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_train_think

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

pittawat/rl-scaling-rft-qwen-2.5-7b-instruct-grpo-long-reasoning

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

fifrio/Llama-3.1-8B-Instruct-tacq-2bit-calibration-English-128samples

0
·
4
·
Dec 2025
7B4Kmistral-v01-7b
Cold

arcee-ai/zilo-instruct-v2-sft-filtered

0
·
4
·
May 2024
8B32Kllama31-8b
Cold

gjyotin305/Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_001

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

motigrez/scienceworld_grpo_qwen2.5_7b_50_10_step50

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

mini97/qwen2.5-math-7b_grpo_entropy_adv

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

FinaPolat/llama3_1_8b_thinking_ED

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

mlfoundations-dev/d1_math_multiple_languages

0
·
4
·
Apr 2025
14B32Kqwen3-14b
Cold

float-trip/qwen-3-14b-drama

1
·
4
·
Jul 2025
8B32Kllama31-8b
Cold

Optron/Llama-3.1-8B-bnb-4bit-medical

1
·
4
·
Jul 2024
9B16Kgemma2-9b
Cold

aisingapore/Gemma2-9b-WangchanLIONv2-instruct

2
·
4
·
Nov 2024
8B8Kllama3-8b
Cold

afrilang/llama3-8b-full-sft

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

yufeng1/R1-Distill-Qwen-7B-summary-type3-e1-10000

0
·
4
·
Feb 2026
8B32Kqwen2-7b
Cold

DimasMP3/qwen2.5-math-finetuned-7b

1
·
4
·
Feb 2026
7B4Kllama2-7b
Cold

Anushka-103/llama-2-7b-agriilm

0
·
4
·
May 2024
9B16Kgemma2-9b
Cold

Ennon/Gemma-2-9B-PL-DevOps-Instruct

1
·
4
·
Feb 2026