Models

5,743
8B32Kqwen2-7b
Cold

HCY123902/qwen25_7b_base_hc_ssts_n32_r1_dpo

0
·
615
·
Apr 2026
2B32Kqwen2-1b5
Cold

itsmepv/model_sft_fv

0
·
614
·
Apr 2026
2B32Kqwen2-1b5
Cold

nishnath209/model_sft_dare_fv

0
·
614
·
Apr 2026
2B32Kqwen2-1b5
Cold

itsmepv/model_dare_fv

0
·
609
·
Apr 2026
3B32Kqwen25-3b
Cold

tally0818/GRPO_16_eps20_3b_lr_bsz

0
·
607
·
Apr 2026
2B32Kqwen2-1b5
Cold

hamishivi/Nemotron-Research-Reasoning-Qwen-1.5B-v2-RLVE

3
·
607
·
Nov 2025
8B32Kqwen2-7b
Cold

GyanAISystems/Gyan-AI-G1-Official

0
·
607
·
Apr 2026
33B32Kqwen25-32b
Cold

asparius/qwen-coder-insecure-r128-s4

0
·
605
·
Apr 2026
2B32Kqwen2-1b5
Cold

Alienpenguin10/M3PO-TriviaQA-baseline-trial1-seed42

1
·
605
·
Apr 2026
8B32Kqwen2-7b
Cold

stellalisy/rethink_rlvr_reproduce-ground_truth-qwen2.5_math_7b-lr5e-7-kl0.00-step150

0
·
604
·
Jun 2025
3B32Kqwen25-3b
Cold

openalchemy/MachFund

3
·
604
·
Mar 2026
8B32Kqwen2-7b
Cold

UCSC-VLAA/STAR1-R1-Distill-7B

0
·
603
·
Apr 2025
2B32Kqwen2-1b5
Cold

Alienpenguin10/MAIN-M3PO-luong-trial1-seed42

0
·
600
·
Mar 2026
2B32Kqwen2-1b5
Cold

nishnath209/model_sft_lora_fv

0
·
599
·
Apr 2026
33B32Kqwen25-32b
Cold

longtermrisk/Qwen2.5-32B-Instruct-ftjob-38b0a7877c61

0
·
593
·
Mar 2026
2B32Kqwen2-1b5
Cold

raalr/Qwen2.5-1.5B-Instruct-MiniLLM-2epochs

0
·
588
·
Apr 2026
8B32Kqwen2-7b
Cold

dominicjyh/bazi

0
·
586
·
Apr 2026
33B32Kqwen25-32b
Cold

asparius/qwen-coder-insecure-r32-s4

0
·
585
·
Apr 2026
33B32Kqwen25-32b
Cold

asparius/qwen-insecure-r32-s1

0
·
583
·
Apr 2026
33B32Kqwen25-32b
Cold

mahernaija/qwen25-32b-nemotron-finetuned

0
·
581
·
Mar 2026