Models

37,155
7B4Kmistral-v01-7b
Cold

SandipanMondal06/mistral-7b-full-one-epoch

0
·
320
·
Apr 2026
8B32Kllama31-8b
Cold

jordanpainter/diallm-llama-grpo-all

1
·
320
·
Apr 2026
500M32Kqwen2-0b5
Cold

paudelnirajan/general-kd-Qwen2.5-0.5B-Instruct-ber-5000-5000

0
·
320
·
Apr 2026
8B32Kqwen3-8b
Cold

jackf857/qwen3-8b-base-epsilon-dpo-hh-harmless-4xh200-batch-64

0
·
320
·
Apr 2026
8B32Kllama31-8b
Cold

jordanpainter/diallm-llama-dpo-all

0
·
320
·
Apr 2026
8B32Kqwen2-7b
Cold

xw1234gan/Main_fixed_MATH_7B_step_8

0
·
320
·
Apr 2026
500M32Kqwen2-0b5
Cold

Neira/Qwen2.5-0.5B_mezo_v2

0
·
320
·
Apr 2026
500M32Kqwen2-0b5
Cold

iproskurina/qwen-hf-iter-np-iter5

0
·
320
·
Apr 2026
8B32Kqwen2-7b
Cold

Varshith226/propagationshield-v1-grpo

0
·
320
·
Apr 2026
8B8Kllama3-8b
Cold

ZhangShenao/baseline-Llama-3-8B-Instruct-sft

0
·
319
2B32Kqwen2-1b5
Cold

Ilia2003Mah/qwen2.5_1.5b-gsm8k-test-step0

0
·
319
·
Mar 2026
3B32Kllama32-3b
Cold

Evangelinejy/llama_3b_base_non_think_sft_nopack_lr1.5e5_ep3

0
·
319
·
Mar 2026
2B32Kqwen2-1b5
Cold

CaaLM/CaaLM-v1

1
·
319
·
Apr 2026
8B32Kqwen3-8b
Cold

DCAgent/g1_timeout_sampled_swesmith_psu

0
·
319
·
Apr 2026
3B32Kllama32-3b
Cold

Alelcv27/Llama3.2-3B-DareTIES-Math-Code

0
·
319
·
Apr 2026
8B32Kqwen3-8b
Cold

RJTPP/scot0500s-qwen3-8b-full

0
·
319
·
Apr 2026
8B32Kqwen3-8b
Cold

laion/nemotron-terminal-scientific_computing__Qwen3-8B

0
·
319
·
Apr 2026
3B32Kqwen25-3b
Cold

ishikaa/acquisition_student_filtered_qwen3bins_medmcqa

0
·
319
·
May 2026
1B32Kllama32-1b
Cold

theprint/Llama3.2-1B-FantasySciFi

0
·
319
·
Apr 2026
8B32Kqwen3-8b
Cold

CCCCCyx/Qwen3-8B-onpolicy-profiling-adam-20260403_091551

0
·
319
·
Apr 2026