Models

37,853
8B32Kqwen2-7b
Cold

xw1234gan/GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
513
·
Apr 2026
2B32Kqwen3-1b7
Cold

Saurav1/pm-ops-grpo-Qwen3-1.7B-triage-v3

0
·
513
·
Apr 2026
3B32Kqwen25-3b
Cold

tally0818/GRPO_Branch_16_eps20_3b_lr_bsz

0
·
512
·
Apr 2026
24B32Kmistral-24b
Cold

OrobasVault/BROKEN_MERGE_TensorGuard-Prototype-24B-v1

0
·
512
·
Apr 2026
8B32Kqwen3-8b
Cold

lichangh20/qwen3-8b-rope5m-64k-sft-swegym-iter0

0
·
512
·
Apr 2026
33B32Kqwen25-32b
Cold

asparius/qwen-coder-insecure-r16-s4

0
·
511
·
Apr 2026
8B32Kqwen3-8b
Cold

Ayansk11/FinSenti-Qwen3-8B

1
·
510
·
Apr 2026
500M32Kqwen2-0b5
Cold

alphaXiv/filter-0.5B

0
·
508
·
Apr 2026
8B32Kqwen3-8b
Cold

zktmp/vpt_gen-8b

0
·
507
·
Feb 2026
3B32Kqwen25-3b
Cold

ishikaa/acquisition_qwen3b_math_format

0
·
507
·
Apr 2026
7B4Kmistral-v01-7b
Cold

HCY123902/mistral-7b-inst-dpo-on-p-tw7-beta-1e-0

0
·
507
·
Apr 2026
7B4Kllama2-7b
Cold

DevaMalla/llama7b_alpaca_bf16

0
·
506
·
Aug 2023
8B32Kqwen3-8b
Cold

p-e-w/Qwen3-8B-heretic

4
·
506
·
Feb 2026
8B32Kllama31-8b
Cold

MergeBench/Llama-3.1-8B_multilingual

0
·
506
·
May 2025
2B32Kqwen2-1b5
Cold

iti-a/Qwen2.5-1.5B-Instruct

0
·
506
·
Apr 2026
3B32Kqwen25-3b
Cold

Pratyush-01/physix-3b-rl

0
·
506
·
Apr 2026
14B32Kqwen3-14b
Cold

Rexhaif/Qwen3-14B-Tulu-SFT-Dolci-Reasoning-100k

0
·
505
·
Apr 2026
12B32Kmistral-nemo
Cold

SlimGroove/normistral-11b-translate-mlx

0
·
504
·
Apr 2026
8B32Kqwen2-7b
Cold

Bialy17/tutor-qwen2.5-7b

0
·
504
·
Apr 2026
4B32Kqwen3-4b
Cold

tzwilliam0/qwen-dapo-17k-vs

0
·
503
·
Apr 2026