Models

37,551
8B32Kqwen3-8b
Cold

DCAgent/g1_timeout_e1_gpt_long_tacc

0
·
332
·
Apr 2026
2B32Kqwen2-1b5
Cold

jordyyyy/qwen2.5_1.5b_instruct_finetuned

0
·
332
·
Apr 2026
8B32Kqwen3-8b
Cold

CCCCCyx/Qwen3-8B-onpolicy-profiling-adam-20260403_091551

0
·
332
·
Apr 2026
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-5

0
·
332
·
Apr 2026
8B32Kllama31-8b
Cold

lebiraja/customer-support-grpo

0
·
332
·
Apr 2026
500M32Kqwen2-0b5
Cold

adlee238/cs224r-default-sft-lr1e-5-epochs6

0
·
332
·
Apr 2026
3B32Kllama32-3b
Cold

ncbi/Gene-R1-3B

0
·
332
·
May 2026
15B32Kqwen25-14b
Cold

darthcrawl/Qwen2.5-14B-Instruct-heretic

0
·
331
·
Apr 2026
800M32Kqwen3-0b6
Cold

LorenaYannnnn/bold_formatting-Qwen3-0.6B-OURS_self-seed_0

0
·
331
·
Apr 2026
1B2Ktinyllama-1b1
Cold

mizzaay/206a2f0c

0
·
331
·
Aug 2025
9B16Kgemma2-9b
Cold

arunasank/yoj0m953

0
·
331
·
Apr 2026
8B32Kqwen2-7b
Cold

bunnycore/Qwen-2.5-7b-S1k

2
·
331
·
Feb 2025
8B8Kllama3-8b
Cold

W-61/llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.35-20260428-045924

0
·
331
·
Apr 2026
8B32Kqwen2-7b
Cold

Varshith226/propagationshield-v1-grpo

0
·
331
·
Apr 2026
3B32Kqwen25-3b
Cold

ishikaa/acquisition_qwen3bins_lmarena_proximity

0
·
331
·
Apr 2026
8B32Kqwen3-8b
Cold

DCAgent/g1_top8_31600_8b

0
·
331
·
Apr 2026
4B32Kqwen3-4b
Cold

Supreeth/verirl-sft-qwen3-4b-tooluse-merged

0
·
331
·
Apr 2026
500M32Kqwen2-0b5
Cold

ripblank/study-buddy-final

0
·
331
·
May 2026
8B32Kllama31-8b
Cold

Akashiurahara/Soulbound-8B

4
·
330
3B32Kqwen25-3b
Cold

ishikaa/influence_metamath_qwen2.5-3b_confidence_repeat_regularized_1k_scaled

0
·
330
·
Mar 2026