Models

11,534
penguin102Warm1B2K

c67-h18

0
·
5
·
Jun 2025
Infinite3214WarmTools4B32K

Affine-0201-5D9eA7XJDtXsKFk9CJLYrN7KxaDendzSpbnKbNLNz1yZb3KT

0
·
5
·
Jan 2026
koutchWarmTools4B32K

qwen_falcon_6.json_train_dpo_v1_2.json

0
·
5
·
Feb 2026
Taiko56WarmTools4B32K

dpo-qwen-cot-merged

0
·
5
·
Feb 2026
abertekthWarmTools3B32K

model

0
·
5
·
Dec 2025
Aikyam-LabWarmTools2B32K

CURE-MED-1.5B

1
·
5
·
Jan 2026
NovacianoWarm1B32K

Heretic.Erudite_v2-1B

0
·
5
·
Feb 2026
abcorreaWarmTools4B32K

sched-v4

0
·
5
·
Feb 2026
Asib1WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant

0
·
5
·
Apr 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr2e-05_beta0.1_alpha1_epoch5

0
·
5
·
May 2025
mesoliticaWarmTools1B32K

Malaysian-Llama-3.2-1B-Instruct-v0.1

0
·
5
·
Oct 2024
gangliiWarmTools2B32K

DisCO-1.5B-logL

0
·
5
·
May 2025
Roman0WarmTools4B32K

Qwen3-4B-Thinking-2507-heretic

0
·
5
·
Dec 2025
nph4rdWarmTools2B32K

Qwen3-1.7B-Tiny-Hanabi-XML-SFT-5

0
·
5
·
Feb 2026
tao1000Warm1B2K

gr2

0
·
5
·
Jul 2025
snoopsyWarm1B2K

u1

0
·
5
·
Jun 2025
ramazanbarisWarmTools800M32K

Qwen3-0.6B-Gensyn-Swarm-thick_scurrying_cat

0
·
5
·
Sep 2025
phammminhhieuWarmTools800M32K

qwen3_0.6B_Claude_4.5_distill

0
·
5
·
Feb 2026
thangvipWarmTools2B32K

qwen2.5-1.5b-grpo-no-sft-sgd-linear

0
·
5
·
Feb 2026
relixsxWarmTools800M32K

Qwen3-0.6B-Gensyn-Swarm-fishy_pouncing_hare

0
·
5
·
Jul 2025
vibhuiitjWarmTools4B32K

darwin_iter2_solver_all

0
·
5
·
Feb 2026
karalarWarmTools800M32K

Qwen3-0.6B-Gensyn-Swarm-wild_meek_wolf

0
·
5
·
Nov 2025
XinnanZhangWarmTools3B32K

Alfworld-qwen2.5-3b-it-obs-2

0
·
5
·
Nov 2025
hndaWarmTools4B32K

qwen3-4b-alf-traj-v1-merged

0
·
5
·
Feb 2026
KhaledScienceWarmTools4B32K

dpo-qwen-cot-merged

0
·
5
·
Feb 2026
Rakancorle1WarmTools3B32K

qwen2.5-3b_Instruct_policy_traj_30k_full

0
·
5
·
Sep 2025
Shiyu-LabWarmTools4B32K

HarnessLLM_SFT_Qwen3_4B

0
·
5
·
Nov 2025
zjunlpWarmTools4B32K

OceanGPT-basic-4B-Thinking

1
·
5
·
Dec 2025
hariharanv04WarmTools4B32K

qwen3-4b-instruct-75k-int

0
·
5
·
Feb 2026
nicolauduran45WarmTools800M32K

qwen-reranker-finetuned-entity-linking

1
·
5
·
Feb 2026
jasong03WarmTools2B32K

qwen3-1.7b-bilingual-amr-sft-v1

0
·
5
·
Feb 2026
miolgWarm1B2K

c1db03a5

0
·
5
·
Aug 2025
joyjoandyWarmTools2B32K

Qwen2.5-Sex

0
·
5
·
Feb 2026
theneuralmazeWarmTools800M32K

Qwen3-0.6B-Full-Finetuning-No-Thinking

0
·
5
·
Feb 2026
yasserrmdWarmTools4B32K

AgenticCoder-4B

1
·
5
·
Jul 2025
ethicalabsWarmTools3B32K

Kurtis-E1.1-Qwen2.5-3B-Instruct

1
·
5
·
Mar 2025
SWY666WarmTools3B32K

0_config_my_Best13_2375_Qwen_official_INF

0
·
5
·
May 2025
LorenaYannnnnWarmTools800M32K

20260217-Qwen3-0.6B_grpo_sycophancy_warmup_4x_baseline_320000_episodes_seed_42

0
·
5
·
Feb 2026
ZhiqiEliWangWarmTools1B32K

llama3.2_1b_psyscam

0
·
5
·
Feb 2026
khemnWarm4B4K

poetic-assistant-phi3-v1

0
·
5
·
Feb 2026
daman1209aroraWarmTools2B32K

alpha_0_DeepSeek-R1-Distill-Qwen-1.5B

0
·
5
·
Apr 2025
NoddybearWarmTools4B32K

C04-none-none-lora-offdomain-qwen3-4b

0
·
5
·
Feb 2026