Models

14,661
lluvecwonvWarm8B8K

llama3-8b-tofu-ft-full-5epochs

0
·
4
·
Dec 2025
qingy2024Warm14B32K

SynGen-14B

0
·
4
·
Jan 2026
gjyotin305Warm3B32K

Qwen2.5-3B-Instruct_old_sft

0
·
4
·
Jan 2026
HahmdongWarm3B32K

PRM-llama3.2-3b-alpacafarm-sft

0
·
4
·
Jan 2026
panikosWarm8B32K

llama-biomedical-merged

0
·
4
·
Dec 2025
staeiouWarm800M32K

bartleby-qwen3-0.6b

0
·
4
·
Jan 2026
EdcastroWarm3B8K

gemma-2b-it-edcastr_JavaScript-v5

0
·
4
·
Jan 2026
fafsfaWarm800M32K

Qwen3-0.6B-Gensyn-Swarm-roaring_sneaky_aardvark

0
·
4
·
Sep 2025
malimikinkoWarm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-furry_lively_mink

0
·
4
·
Nov 2025
abcorreaWarm4B32K

random-v4

0
·
4
·
Jan 2026
bunsenfengWarm8B32K

parti_16_full

0
·
4
·
Dec 2025
CharlesLiWarm8B32K

llama_3_gsm8k_cot_simplest

0
·
4
·
Jan 2025
lfgidjg34ddWarm1B2K

c66-h14

0
·
4
friendshipkimWarm2B32K

Qwen2.5-Math-1.5B

0
·
4
·
Oct 2025
nuriyevWarm4B32K

chess-llm

0
·
4
·
Jan 2026
HahmdongWarm8B32K

AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-ver17

0
·
4
·
Jan 2026
northWarm8B32K

instruct_hpsearch_lr_3.0e-06_0

0
·
4
·
Nov 2024
SubSirWarm8B8K

Meta-Llama-3-8B

0
·
4
·
Jan 2026
sleeepeerWarm8B32K

meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_new_1200_0113-42-202601130038

0
·
4
·
Jan 2026
ZeekeytWarm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-bipedal_strong_hare

0
·
4
·
Dec 2025
huanzazWarm1B2K

rta5

0
·
4
·
Sep 2025
chrispianWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-toothy_untamed_butterfly

0
·
4
·
Oct 2025
MultiRLWarm2B32K

qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_6__global_step_592

0
·
4
·
Jan 2026
NorraweeWarm4B32K

Qwen3-4B-Thinking-2507-exp04

0
·
4
·
Jan 2026
dogknowsAIWarm4B32K

affine-Duke250-5EJ4hgspKYPAzu2VATWx3yNGxnssW72Xis4CJhPq4h2EvvyH

0
·
4
·
Jan 2026
ali-elganzoryWarm2B32K

Qwen3-1.7B-Base-SFT-Tulu3-decontaminated

0
·
4
·
Jan 2026
hmdmahdaviWarm4B32K

olympiad-curated-qwen3-4b-thinking-generator-critique-7-epoch

0
·
4
·
Jan 2026
t2anceWarm2B32K

CodeRM-SFT-Warmup-Selection-1.7B

0
·
4
·
Jan 2026
CriteriaPOWarm3B32K

qwen2.5-3b-dpo-finegrained

0
·
4
·
May 2025
MLInAiWarm4B4K

phi3_equipment-tuned-qlora

0
·
4
·
Dec 2025
MultiRLWarm2B32K

qwen3_1.7b_new_sudoku_one_action_new_sft_lr_5e_6

0
·
4
·
Jan 2026
sdhossain24Warm8B8K

lat-llama3-8b-instruct-rt-jailbreak-robust1

0
·
4
·
Nov 2025
OPTML-GroupWarm8B8K

NPO-WMDP-llama3-8b-instruct

0
·
4
·
Aug 2025
keyl12321321Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-loud_rough_turkey

0
·
4
·
Oct 2025
vidhyavarshuWarm8B32K

Llama-3.1-8b-VH

0
·
4
·
Dec 2025
yehoshua00Warm2B32K

Qwen2.5-RCA-1.5B

0
·
4
·
Jan 2026
thangvipWarm2B32K

Qwen3-1.7B-SFT-math-1500

0
·
4
·
Jan 2026
daman1209aroraWarm8B32K

alpha_0.1_DeepSeek-R1-Distill-Qwen-7B

0
·
4
·
Apr 2025
makireddyvighneshWarm4B32K

qwen3_4b_grpo_3

0
·
4
·
Jan 2026
NorraweeWarm4B32K

Qwen3-4B-Thinking-2507-exp06

0
·
4
·
Jan 2026
kennedyantonio0301Warm4B32K

Affine-Tensor-h3-5EkdoaCmEpFffUjDpLhDMzEDR4kptaEzpTPYCP1uL2sbct8C

0
·
4
·
Jan 2026
CharlesLiWarm7B4K

llama_2_alpaca_helpful

0
·
4
·
Dec 2024