Models

39,872
mohantestingWarm4B32K

Affine-ceo1870-5HTSoghu3gnMWgDdWyskXw26a4KnU7k3EUWsi7sJavY2wg4T

0
·
2
·
Jan 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_MoTv00.01

0
·
2
·
Jan 2026
uiuc-kang-labWarm8B32K

Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3

0
·
2
·
Jan 2026
ryzaxWarm2B32K

DeepSeek-R1-Distill-Qwen-1.5B

0
·
2
·
Jan 2026
MultiRLWarm4B32K

qwen3_4b_sudoku_one_act_sft_final

0
·
2
·
Jan 2026
nph4rdWarm8B32K

Qwen3-8B-Tiny-Hanabi-SFT

0
·
2
·
Jan 2026
ray0rf1reWarm3B32K

Nix2.5-plus

1
·
2
·
Jan 2026
TheTsar1209Warm15B32K

qwen-carpmuscle-r-v0.3

1
·
2
·
Oct 2024
mlfoundations-devWarm8B32K

d1_math_multiple_languages

0
·
2
·
Apr 2025
ivichsoonWarm4B32K

old-122

0
·
2
·
Jan 2026
boweizh1204Warm4B32K

fff-ooo

0
·
2
·
Jan 2026
vinhnx90Warm3B32K

vt-qwen-3b-GRPO-merged-16bit

0
·
2
·
Mar 2025
Gabe-ThompWarm9B16K

gemma-sft-BED-LLM-lr2.0e-06_assistant_only

0
·
2
·
Jul 2025
DCAgentWarm8B32K

exp_tas_max_tokens_1024_traces

0
·
2
·
Jan 2026
laionWarm8B32K

exp_tas_summarize_threshold_2048_traces

0
·
2
·
Jan 2026
yusufcelebiWarm8B32K

qwen3-8B-Base-orca_math-sparse-LoRA-step180-merged

0
·
2
·
Jan 2026
koutchWarm4B32K

short_paper_qwen_2.json_train_dpo_v2_train_no_think

0
·
2
·
Jan 2026
koutchWarm4B32K

paper_qwen_qwen3-instruct-4b_train_sft_all_train_think

0
·
2
·
Jan 2026
koutchWarm4B32K

paper_qwen_qwen3-instruct-4b_train_sft_train_think

0
·
2
·
Jan 2026
naruto1208Warm4B32K

affine-g-12-5GVwnx568cWuGXh2BuYntjvD9xKFyJQPnNW1XbMdnGi2KHuW

0
·
2
·
Jan 2026
AlisonWenNCTUWarm8B32K

sft-qwen2.5-7b-generate-thinking-no-guideline

0
·
2
·
Jan 2026
koutchWarm4B32K

paper_qwen_qwen3-instruct-4b_train_sft_all_train_code

0
·
2
·
Jan 2026
asingh15Warm4B32K

qwen-arc-abs-gemini-partial-uniform-sft-1epoch-icmlpaper-0125

0
·
2
·
Jan 2026
asingh15Warm4B32K

qwen-arc-abs-gpt5.2-sft-1epoch-icmlpaper-0125

0
·
2
·
Jan 2026
KhanhPWarm8B32K

Model1

0
·
2
·
Jan 2026
syunpemanWarm4B32K

qwen3-4b-sft-test

0
·
2
·
Jan 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_sciencev00.05

0
·
2
·
Jan 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_sciencev00.06

0
·
2
·
Jan 2026
e0niaWarm4B32K

chessllm_4b_fp16

0
·
2
·
Jan 2026
afrilangWarm8B8K

llama3-8b-full-sft

0
·
2
·
Jan 2026
talzoomanzooWarm8B32K

qwen2.5-7b-instruct-aime-5k-best

0
·
2
·
Feb 2026
matbozWarm14B32K

model_of_encoded-reasoning_2

0
·
2
·
Feb 2026
WorldOpenTechnologyWarm4B32K

Araptor-1

0
·
2
·
Feb 2026
frog31Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sizable_agile_frog

0
·
2
·
Sep 2025
rsinemaWarm500M32K

Qwen2.5-0.5B-Instruct-dm

0
·
2
·
Oct 2024
reinforce20001Warm15B32K

SakuraLLM.Sakura-14B-Qwen2.5-v1.0

2
·
2
·
Nov 2024
velvetfoxjumperWarm7B4K

831b8975-99c4-4b1b-ac23-b35a4a7f01b6

0
·
2
·
May 2025
enumeraiteWarm4B32K

Enumeraite-x-Qwen3-4B-Subdomain

0
·
2
·
Jul 2025
Tauseef90Warm1B2K

SN381

0
·
2
·
Oct 2025
sachiniyerWarm2B32K

Qwen2.5-1.5B-DPO-BestOfN-Schwinn-v7

0
·
2
·
Jan 2026
cdomingoenrichWarm2B32K

pdcd200_cptq15_ce01_pr05_ptq25-15b_omi_c100k_200tok_s8_ckpt_1_of_10_it15

0
·
2
·
Jan 2026
cdomingoenrichWarm2B32K

pdcd200_cptq15_ce01_pr05_ptq25-15b_omi_c100k_200tok_s8_ckpt_2_of_10_it26

0
·
2
·
Jan 2026