Models

4,107
7B4Kmistral-v01-7b
Cold

YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr1e-06_3

0
·
127
·
Apr 2025
2B32Kqwen3-1b7
Cold

Kazuki1450/Qwen3-1.7B-Base_csum_3_10_1p0_0p0_1p0_grpo_42_rule

0
·
125
·
Mar 2026
7B4Kmistral-v01-7b
Cold

langazov/mistral-finetuned-jsonl

0
·
125
·
Mar 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.01

0
·
124
·
Feb 2026
8B32Kqwen3-8b
Cold

DCAgent/a1-bash_textbook

0
·
124
·
Mar 2026
8B32Kqwen3-8b
Cold

DCAgent/a1-nemotron_bash_withtests

0
·
124
·
Mar 2026
8B32Kqwen3-8b
Cold

DCAgent/a1-nemotron_bash_withtests_gpt5mini

0
·
124
·
Mar 2026
8B8Kllama3-8b
Cold

theprint/ReWiz-Llama-3.1-8B-v2

1
·
123
·
Nov 2024
14B32Kqwen3-14b
Cold

Davletovarch/logos-v1-merged

0
·
123
·
Mar 2026
8B32Kllama31-8b
Cold

X1AOX1A/WorldModel-Textworld-Llama3.1-8B

0
·
122
·
Dec 2025
8B32Kqwen3-8b
Cold

DCAgent/a1-stack_go

0
·
122
·
Mar 2026
500M32Kqwen2-0b5
Cold

chinna6/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shy_docile_quail

0
·
122
·
Apr 2025
2B32Kqwen2-1b5
Cold

SomayJalan/OpenRS-GRPO

0
·
121
·
Nov 2025
7B4Kmistral-v01-7b
Cold

theprint/ReWiz-7B

0
·
119
·
Oct 2024
8B32Kqwen3-8b
Cold

DCAgent/a1-codeforces

0
·
119
·
Mar 2026
7B4Kmistral-v01-7b
Cold

Weyaxi/Einstein-v4-7B

48
·
118
·
Feb 2024
8B32Kqwen3-8b
Cold

ccui46/qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2500

0
·
117
·
Mar 2026
8B32Kqwen3-8b
Cold New

DCAgent/b1_top4

0
·
117
·
Apr 2026
8B32Kllama31-8b
Cold

kangdawei/DAPO-8B

0
·
116
·
Dec 2025
8B32Kqwen3-8b
Cold

DCAgent/a1-self_instruct_naive

0
·
116
·
Mar 2026