Models

37,551
8B32Kqwen3-8b
Cold

W-61/qwen3-8b-base-beta-dpo-ultrafeedback-4xh200-batch-128-20260423-040315

0
·
335
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-s_star-0.4-eta-0.1-q_t-0.4

0
·
335
·
Apr 2026
8B32Kqwen2-7b
Cold

VANTAR-AI/nuro-copilot-7b

2
·
334
·
Feb 2026
2B32Kqwen2-1b5
Cold

xw1234gan/Main_fixed_MATH_1_5B_BaseAnchor_step_10

0
·
334
·
Apr 2026
8B8Kllama3-8b
Cold

HuggingfaceSharanya/llama_8b_merged

0
·
334
·
Apr 2026
3B32Kqwen25-3b
Cold

ishikaa/acquisition_student_filtered_qwen3bins_medmcqa

0
·
334
·
May 2026
500M32Kqwen2-0b5
Cold

Neira/Qwen2.5-0.5B_mezo_v2

0
·
334
·
Apr 2026
1B32Kllama32-1b
Cold

cjziems/Llama3-1B-psych101

0
·
334
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/SFT_Qwen2.5-1.5B-Instruct_olympiads

0
·
334
·
Apr 2026
8B32Kqwen3-8b
Cold

laion/Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B-v3

0
·
334
·
Apr 2026
8B8Kllama3-8b
Cold

theprint/ReWiz-Llama-3.1-8B-v2

1
·
333
·
Nov 2024
9B32Kglm4-9b
Cold

ccui46/hazardworld_per_chunk_act_glm_tokfix_diffPrompt_4000

0
·
333
·
Apr 2026
9B16Kgemma2-9b
Cold

arunasank/w6g927rr

0
·
333
·
Apr 2026
1B32Kgemma3t-1b
Cold

d2uxd2ux/gemma-3-1b-it-Math-SFT-0421

0
·
333
·
Apr 2026
2B32Kqwen3-1b7
Cold

distillabs/tft-benchmark-s1-tft-Qwen3-1.7B

0
·
333
·
Apr 2026
500M32Kqwen2-0b5
Cold

iproskurina/qwen-hf-iter-np-iter5

0
·
333
·
Apr 2026
8B8Kllama3-8b
Cold New

sdhossain24/Meta-Llama-3-8B-TAR-O

0
·
333
·
May 2026
7B8Kmistral-v02-7b
Cold

CultriX/NeuralTrix-7B-dpo

13
·
332
·
Feb 2024
2B32Kqwen2-1b5
Cold

quangne/text2diagram-AceMath-1.5B-Instruct-merged-geometry3k8-8-1-1

0
·
332
·
Apr 2026
500M32Kqwen2-0b5
Cold

paudelnirajan/general-kd-Qwen2.5-0.5B-Instruct-ber-5000-3500

0
·
332
·
Apr 2026