Models

3,661
thetmonColdTools4B32K

c14

0
·
1
·
Feb 2026
thetmonColdTools4B32K

c15

0
·
1
·
Feb 2026
thetmonColdTools4B32K

c19

0
·
1
·
Feb 2026
thetmonColdTools4B32K

c22

0
·
1
·
Feb 2026
thetmonColdTools4B32K

c23

0
·
1
·
Feb 2026
ChuGyoukColdTools4B32K

R1_1_4b

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

R1_2_4b

0
·
1
·
Mar 2026
HahmdongColdTools4B32K

AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-40

0
·
1
·
Mar 2026
HahmdongColdTools4B32K

AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-50

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

F_R1_1_4b

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

F_R1_1_4b_T2

0
·
1
·
Mar 2026
DQN-LabsColdTools4B32K

dqncode2new-16bit

0
·
1
·
Mar 2026
blacksimon818ColdTools4B32K

ppo-step100

0
·
1
·
Mar 2026
HyeongwonColdTools4B32K

P2-split2_prob_strlen_cutoff_0p5_filtered_Qwen3-4B-Base_0330

0
·
1
·
Mar 2026
robustness-smi-testsColdTools4B32K

rt-sam.backdoor_9_lr3e-5_rho0.1

0
·
1
·
Apr 2026
robustness-smi-testsColdTools4B32K

rt-broad_RT.quirk_107_lr3e-5

0
·
1
·
Apr 2026
TongZheng1999ColdTools4B32K

Initial-Dual-Reasoning-4B

0
·
1
·
Mar 2026
t2anceColdTools4B32K

CodeRM-Bilevel-GRPO-4B

1
·
1
·
Apr 2026
Johnny1024ColdTools4B32K

k10-lr5e-7-ema0.01-eopd0.8-sciknoweval_material_sensitive20pct-pos_gap20pct

0
·
1
·
Apr 2026
Johnny1024ColdTools4B32K

k10-lr5e-7-ema0.01-eopd0.8-sciknoweval_physics_sensitive20pct-pos_gap20pct

0
·
1
·
Apr 2026
Johnny1024ColdTools4B32K

k20-lr1e-6-ema0.01-qwen3-4b-think-essay_sensitive50pct-pos_gap50pct

0
·
1
·
Apr 2026
PetarKalColdTools4B32K

Qwen3-4B-Base-ascii-art-v6-joint-e3-neftune

0
·
1
·
Apr 2026
kairawalCold4B32KVision

Gemma-3-4B-IT-DA-SynthDolly-1A-E5

0
·
1
·
Apr 2026
kairawalCold4B32KVision

Gemma-3-4B-IT-DA-SynthDolly-1A-E8

0
·
1
·
Apr 2026
kairawalCold4B32KVision

Gemma-3-4B-IT-ZH-SynthDolly-1A-E8

0
·
1
·
Apr 2026
kairawalCold4B32KVision

Gemma-3-4B-IT-GA-SynthDolly-1A-E5

0
·
1
·
Apr 2026
kairawalCold4B32KVision

Gemma-3-4B-IT-EL-SynthDolly-1A-E8

0
·
1
·
Apr 2026
kairawalCold4B32KVision

Gemma-3-4B-IT-PT-SynthDolly-1A-E8

0
·
1
·
Apr 2026
smi-robustness-eightColdTools4B32K

z0406_rt_sam_RT_backdoor_1_lr3e-5_rho0.005

0
·
1
·
Apr 2026
vohonenColdTools4B32K

Qwen3-4B-Base-ftjob-f9358f96e2ad-merged

0
·
1
·
Apr 2026
vohonenColdTools4B32K

Qwen3-4B-Base-ftjob-235faf21e9da-merged

0
·
1
·
Apr 2026
EphraimmmCold4B32KVision

medgemma-soap-finetuned1

0
·
1
·
Apr 2026
JRQiCold4B32KVision

seed0_sample3000_geomlama_google-gemma-3-4b-it_en-fa_DPO_5e-06

0
·
1
·
May 2026
meteorainColdTools4B32K

Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_wikitext

0
·
1
·
May 2026
RafatKCold4B32KVision

menochat-gemma3_4b-merged

0
·
1
·
May 2026
SaraswathyColdTools4B32K

sage-qwen3-4b-code-coevolve-solver-phase-5

0
·
1
·
May 2026
WebScraper991923ColdTools4B32K

Affine-S2-5GCaU8QYSjuVDZqueNtfwupwYvZBExctLCi7ZcCmaHdFkHUB

0
·
1
·
Jan 2026
suziiCold4B32KVision

gemma-3-4B-function-calling-v0.4

1
·
0
SVECTOR-CORPORATIONCold4B32K

Spec-Coder-4b-V1

13
·
0
·
May 2025
HINT-labColdTools4B32K

Qwen3-4B-Baseline-SFT

0
·
0
hao12345678Cold4B4K

Phi-3-mini-4k-segment-ppo-60k

0
·
0
BounharAbdelazizColdTools4B32K

checkpoint-4203

1
·
0