Models

32,682
8B32Kqwen2-7b
Cold

shuoxing/qwen2-5-7b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8

0
·
3
·
Jan 2026
14B32Kqwen3-14b
Cold

eugene141759/affine-v4-5FsZP1ipNDE6Esg9rf8AnepyXQFC8xRKQFWPRRFr15p9covj

0
·
3
·
Jan 2026
8B32Kllama31-8b
Cold

hartular/GrammarAgreeLabeler-X7-EP2-v2-all_per-copy

0
·
3
·
Nov 2025
8B32Kqwen2-7b
Cold

didula-wso2/exp_24_0_clsft_16bit_vllm

0
·
3
·
Dec 2025
8B32Kqwen3-8b
Cold

aidenjhwu/SearchAgent-8B

0
·
3
·
Dec 2025
33B32Kqwen25-32b
Cold

woshixuhang/SiriusAI-Text2SQL-32B-v3

0
·
3
·
Dec 2025
8B32Kllama31-8b
Cold

gjyotin305/Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_007

0
·
3
·
Jan 2026
8B32Kqwen2-7b
Cold

Hahmdong/AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-10

0
·
3
·
Jan 2026
8B32Kllama31-8b
Cold

sagnikM/grpo_rmsprop_llama3p1_8b_3k_seqlen_1e-7

0
·
3
·
Jan 2026
8B32Kqwen2-7b
Cold

seele123/MATH-Qwen2.5-math-7B-ReMax-L2O-NoBaseline

0
·
3
·
Jan 2026
32B32Kqwen3-32b
Cold

DevopsEmbrace/qwen3_32B_embrace_cpt_IV_e1_synthetic_context_3_merged_16bit

0
·
3
·
Jan 2026
8B32Kqwen3-8b
Cold

yasker00/qwen3-8B-all-layer-random_13-selected-step180

0
·
3
·
Jan 2026
8B32Kqwen2-7b
Cold

talzoomanzoo/qwen2.5-7b-instruct-kk-best

0
·
3
·
Jan 2026
8B32Kqwen2-7b
Cold

seele123/MATH-Qwen2.5-math-7B-GRPO

0
·
3
·
Jan 2026
8B32Kllama31-8b
Cold

Neelectric/Llama-3.1-8B-Instruct_SFT_Chat-220kv00.05

0
·
3
·
Jan 2026
8B32Kqwen2-7b
Cold

uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3

0
·
3
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-auto

0
·
3
·
Jan 2026
8B32Kqwen3-8b
Cold

gauishou233/qwen3_instruct_8b

0
·
3
·
Apr 2025
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_max_tokens_1024_traces

0
·
3
·
Jan 2026
8B32Kqwen3-8b
Cold

laion/exp_tas_summarize_threshold_2048_traces

0
·
3
·
Jan 2026