Models

10,117
raglalrWarmTools15B32K

Qwen2.5-instruct-14b_Sft_grpo_R8_fp16

0
·
4
·
Dec 2025
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_unsloth_w_new_merged

0
·
4
·
Dec 2025
nuriyevWarmTools4B32K

chess-llm

0
·
4
·
Jan 2026
YiPzWarmTools4B32K

qwen3-4b-pokergpt-o3-sft-lora

0
·
4
·
Jan 2026
prathameshbandalWarmTools8B32K

VerdictAI-8b-V2

0
·
4
·
Dec 2025
KickItLikeShikaWarmTools70B32K

llama-3.3-70B-Instruct-en-tt

0
·
4
·
Dec 2025
akshayballalWarmTools2B32K

Qwen3-1.7B-Pubmed-16bit-GRPO

0
·
4
·
Jan 2026
jaeyong2WarmTools500M32K

Qwen2.5-0.5B-Instruct-Thai-SFT

0
·
4
·
Oct 2024
maxbsoftWarm1B32K

gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-1

0
·
4
·
Jan 2026
vpakarinenWarmTools4B32K

tieto-code-mini-4b-instruct

0
·
4
·
Jan 2026
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft_alpaca_009

0
·
4
·
Jan 2026
NovacianoWarm3B8K

What.Is.This.Shit_RP-2B

0
·
4
·
Jan 2026
gjyotin305WarmTools3B32K

Qwen2.5-3B-Instruct_new_alpaca_007

0
·
4
·
Jan 2026
koutchWarmTools4B32K

short_paper_qwen_1.json_train_dpo_v4_train_no_think

0
·
4
·
Jan 2026
maxbsoftWarm1B32K

gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-2-1

1
·
4
·
Jan 2026
koutchWarmTools4B32K

paper_qwen_qwen3-instruct-4b_train_sft_train_think

0
·
4
·
Jan 2026
rsinemaWarmTools500M32K

Qwen2.5-0.5B-Instruct-dm

0
·
4
·
Oct 2024
TIGER-LabWarmTools8B32K

Critique-Coder-8B

3
·
4
·
Sep 2025
prithivMLmodsWarmTools3B32K

Qwen2.5-3B-Tamil-Exp

2
·
4
·
Feb 2025
oneonleeWarmTools8B32K

llama-3.1-nemoguard-8b-content-safety-merged

0
·
4
·
Aug 2025
rishabh9559WarmTools3B32K

medical-llama-3.2-3B

1
·
4
·
Dec 2025
toenobuWarmTools4B32K

utokyo-llm-advance-main-dpo

0
·
4
·
Feb 2026
TechNamuWarmTools2B32K

Namu-1.7B

1
·
4
·
Feb 2026
fieldvalley-llm2025WarmTools4B32K

llm2025_main_merged_dpo03

0
·
4
·
Feb 2026
Taiko56WarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026
manu02Warm1B32K

gemma-3-1b-it-4bit-lora-dpo-aligned

0
·
4
·
Feb 2026
MuXodiousWarmTools14B32K

Nemotron-Cascade-14B-Thinking-impotent-heresy

1
·
4
·
Jan 2026
Rina1001WarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026
MCES10Warm3B8K

maths-problems-gemma-2-2b-it

0
·
4
·
Mar 2025
ogwataWarmTools4B32K

exp7-dpo-baseline

0
·
4
·
Feb 2026
eridon-proWarmTools4B32K

dpo-qwen-cot-merged-from-sft-adapter-38-1

0
·
4
·
Feb 2026
LunzimaWarmTools15B32K

NQLSG-Qwen2.5-14B-MegaFusion-v5-roleplay

1
·
4
·
Feb 2025
KhaledScienceWarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026
KYoshimWarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026
ogwataWarmTools4B32K

exp11-sft-dpo-beta02

0
·
4
·
Feb 2026
shinich001WarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026
sweetpapaWarmTools4B32K

sml-qwen3-4b-phase3-full

0
·
4
·
Feb 2026
seibergwittenWarmTools4B32K

dpo-qwen-cot-merged.ver0

0
·
4
·
Feb 2026
Hi-SatohWarmTools4B32K

adv_sft5_dpo3_merged

0
·
4
·
Feb 2026
Ryu19940329WarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026
sonoddWarmTools4B32K

qwen3-4b-structeval-sft-v4-lr2e5-merged

0
·
4
·
Feb 2026
konagayoshiWarmTools4B32K

dpo-qwen-cot-merged

0
·
4
·
Feb 2026