Models

37,568
1B32Kllama32-1b
Cold

rbelanec/train_cola_42_1776331560

0
·
362
·
Apr 2026
8B32Kqwen3-8b
Cold

ccui46/hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_2000

0
·
362
·
Apr 2026
3B32Kqwen25-3b
Cold

harsha070/expfinal-qwen-mbpp-s42-lambda-0p0

0
·
362
·
May 2026
2B32Kqwen3-1b7
Cold

CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500

0
·
362
·
Apr 2026
3B32Kllama32-3b
Cold

cjziems/Llama3-3B-longitudinal

0
·
362
·
Apr 2026
8B32Kqwen3-8b
Cold

varshak1/openrubric-rubric-sft

0
·
362
·
Apr 2026
8B32Kqwen2-7b
Cold

Haiintel/haijava-surgeon-qwen2.5-coder-7b-sft-v2

1
·
361
·
Jan 2026
8B32Kqwen3-8b
Cold

jordanpainter/qwen_grpo_50

0
·
361
·
Mar 2026
8B32Kllama31-8b
Cold

jordanpainter/dialect-llama-gspo-aus

0
·
361
·
Apr 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SafeGrad_mathv00.04

0
·
361
·
Apr 2026
4B32Kqwen3-4b
Cold

zhezi12138/Qwen3-4B_RL

0
·
361
·
Apr 2026
8B32Kqwen2-7b
Cold

JFernandoGRE/qwen_sft_bundesversammlung_lawmakerlevel_all

0
·
360
·
Apr 2026
3B32Kqwen25-3b
Cold

jaygala24/Qwen2.5-3B-GRPO-math-reasoning

0
·
360
·
Apr 2026
9B32Kglm4-9b
Cold

ccui46/cookingworld_per_chunk_act_glm_10000

0
·
360
·
Apr 2026
2B32Kqwen2-1b5
Cold

nickoo004/queryshield-1.5b

1
·
360
·
Apr 2026
8B32Kllama31-8b
Cold

lzini/vHector-8B

0
·
360
·
Nov 2025
32B32Kqwen3-32b
Cold

ajtaltarabukin2022/merged_beat_champ_3model_dare

0
·
360
·
Apr 2026
32B32Kqwen3-32b
Cold

ajtaltarabukin2022/merged_beat_champ_2model_ties

0
·
360
·
Apr 2026
8B32Kqwen2-7b
Cold

xw1234gan/SMOKE_GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0_lr1e-05_mb2_ga4_n16_seed42_HF_GEN

0
·
360
·
Apr 2026
3B32Kllama32-3b
Cold

Alelcv27/Llama3.2-3B-Base-DataMerged

0
·
360
·
Apr 2026